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NOVEL MOLECULES OF THE CARD -RELATED PROTEIN 
FAMILY AND USES THEREOF 

Cross Reference to Related Applications 
This application is a continuation-in-part of U.S. 
Application Serial No. 09/207,359 filed December 8, 1998, 
which is a continuation-in-part of U.S. Application 
Serial No. 09/099,041, filed June 17, 1998, which is a 
continuation-in-part of U.S. Application Serial No. 
09/019,942, filed February 6, 1998. The contents of each 
of these applications is incorporated herein by this 
reference . 

Background of the Invention 
In multicellular organisms, homeostasis is 
maintained by balancing the rate of cell proliferation 
against the rate of cell death. Cell proliferation is 
influenced by numerous growth factors and the expression 
of proto-oncogenes, which typically encourage progression 
through the cell cycle. In contrast, numerous events, 
including the expression of tumor suppressor genes, can 
lead to an arrest of cellular proliferation. 

In differentiated cells, a particular type of cell 
death called apoptosis occurs when an internal suicide 
program is activated. This program can be initiated by a 
variety of external signals as well as signals that are 
generated within the cell in response to, for example, 
genetic damage. For many years, the magnitude of 
apoptotic cell death was not appreciated because the 
dying cells are quickly eliminated by phagocytes, without 
an inflammatory response. 

The mechanisms that mediate apoptosis have been 
intensively studied. These mechanisms involve the 
activation of endogenous proteases, loss of mitochondrial 
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function, and structural changes such as disruption of 
the cytoskeleton, cell shrinkage, membrane blebbing, and 
nuclear condensation due to degradation of DNA. The 
various signals that trigger apoptosis are thought to 
bring about these events by converging on a common cell 
death pathway that is regulated by the expression of 
genes that are highly conserved from worms, such as C. 
elegans, to humans. In fact, invertebrate model systems 
have been invaluable tools in identifying and 
characterizing the genes that control apoptosis. Through 
the study of invertebrates and more evolved animals, 
numerous genes that are associated with cell death have 
been identified, but the way in which their products 
interact to execute the apoptotic program is poorly 
understood. 

Caspases, a class of proteins central to the 
apoptotic program, are cysteine protease having 
specificity for aspartate at the substrate cleavage site. 
These proteases are primarily responsible for the 
degradation of cellular proteins that lead to the 
morphological changes seen in cells undergoing apoptosis. 
For example, one of the caspases identified in humans was 
previously known as the interleukin-lo; (IL-lcO converting 
enzyme (ICE) , a cysteine protease responsible for the 
processing of pro-IL-lo; to the active cytokine. 
Overexpression of ICE in Rat-1 fibroblasts induces 
apoptosis (Miura et al . , Cell 75:653, 1993). 

Many caspases and proteins that interact with 
caspases possess domains of about 60 amino acids called a 
caspase recruitment domain (CARD). Hofmann et al . (TIBS 
22:155, 1997) and others have postulated that certain 
apoptotic proteins bind to each other via their CARDs and 
that different subtypes of CARDs may confer binding 
specificity, regulating the activity of various caspases, 
for example. 
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The functional significance of CARDs have been 
demonstrated in recent publications. Duan et al . (Nature 
385:86, 1997) showed that deleting the CARD at the 
N- terminus of RAIDD, a newly identified protein involved 
5 in apoptosis, abolished the ability of RAIDD to bind to 
caspases. In addition, Li et al . (Cell 91:479, 1997) 
showed that the N-terminal 97 amino acids of apoptotic 
protease activating factor- 1 (Apaf-1) was sufficient to 
confer caspase- 9-binding ability. Inohara et al . (J. 
10 Biol. Chem. 273:12296-12300, 1998) showed that Apaf-1 can 
bind several other caspases such as caspase-4 and 
caspase-8. Apaf-1 can interact with caspases via 
CARD-CARD interaction (Li et al . , supra, Hu et al . , PNAS, 
95 :4386-4391, 1998) . 
15 Nuclear factor- kB (NF-kB) is a transcription 

factor expressed in many cell types and which activates 
homologous or heterologous genes that have kB sites in 
their promoters. Quiescent NF-kB resides in the 
cytoplasm as a heterodimer between proteins referred to 
20 as p50 and p65 and is complexed with the regulatory 

protein I/cB. NF-/cB binding to IkB causes NF-kB to remain 
in the cytoplasm. At least two dozen stimuli that 
activate NF-kB are known (New England Journal of Medicine 
336:1066, 1997) and they include cytokines, protein 
25 kinase C activators, oxidants, viruses, and immune system 
stimuli. NF-kB activating stimuli activate specific IkB 
kinases that phosphorylate IkB leading to its 
degradation. Once liberated from IkB, NFkB translocates 
to the nucleus and activates genes with kB sites in their 
30 promoters. How all of these NF-kB activating stimuli act 
is unknown at the present time and it is presumed that 
novel NF- kB pathway components are involved. NF-kB and 
the NF-kB pathway has been implicated in mediating 
chronic inflammation in inflammatory diseases such as 
35 asthma, ulcerative colitis, rheumatoid arthritis (New 
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England Journal of Medicine 336:1066, 1997) and 
inhibiting NF-kB or NF-kB pathways may be an effective 
way of treating these diseases. NF-kB and the NF-kB 
pathway has also been implicated in atherosclerosis 
(American Journal of Cardiology 76:18c, 1995), especially 
in mediating fatty streak formation, and inhibiting NF-kB 
or NF-kB pathways may be an effective therapy for 
atherosclerosis . 

Summary of the Invention 

The present invention is based, at least in part, 
on the discovery of genes encoding CARD-3 and CARD-4. 
The CARD-4 gene can express a long transcript that 
encodes CARD-4L, a short transcript that encodes partial 
CARD - 4 S , or two CARD-4 splice variants. A murine full 
length cDNA sequence for the murine ortholog of CARD-4L 
is also presented. CARD-3 and CARD-4 are intracellular 
proteins that are predicted to be involved in regulating 
caspase activation. CARD-4 is found to activate the 
NF-kB pathway and to enhance caspase 9-mediated cell 
death. In addition, proteins that bind to CARD-4 are 
presented including CARD-3 and hNUDC. 

The CARD-3 cDNA described below (SEQ ID NO:l) has 
a 1620 open reading frame (nucleotides 214 to 1833 of SEQ 
ID NO:l; SEQ ID NO : 3 ) which encodes a 54 0 amino acid 
protein (SEQ ID NO:2). CARD-3 contains a kinase domain 
which extends from amino acid 1 to amino acid 300 of SEQ 
ID NO: 2; SEQ ID NO : 4 , followed by a linker domain at 
amino acid 3 01 to amino acid 431 of SEQ ID NO: 2; SEQ ID 
NO: 5 and a CARD at amino acid 432 to amino acid 54 0 of 
SEQ ID NO: 2; SEQ ID NO: 6. 

At least four forms of CARD-4 exist in the cell, a 
long form, CARD-4L, a short form, CARD-4S, and two splice 
variants, CARD-4Y and CARD-4Z. The cDNA of CARD-4L 
described below (SEQ ID NO: 7) has a 2859 nucleotide open 
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reading frame (nucleotides 245-3103 of SEQ ID NO : 7 ; SEQ 
ID NO: 9) which encodes a 953 amino acid protein (SEQ ID 
NO: 8) . CARD- 4 L protein possesses a CARD domain (amino 
acids 15-114; SEQ ID NO:10). The nucleotide sequence of 
the full length cDNA corresponding to the murine ortholog 
of human CARD-4L is presented (SEQ ID NO: 42) as is the 
predicted amino acid sequence of murine CARD-4L (SEQ ID 
NO: 43) . A comparison between the predicted amino acid 
sequences of human CARD - 4 L and murine CARD - 4 L is also 
depicted in Figure 17. 

Human CARD - 4 L is also predicted to have a 
nucleotide binding domain which extends from about amino 
acid 198 to about amino acid 3 97 of SEQ ID NO: 8; SEQ ID 
NO: 11, a Walker Box "A" , which extends from about amino 
acid 2 02 to about amino acid 209 of SEQ ID NO: 8; SEQ ID 
NO: 12, a Walker Box "B" , which extends from about amino 
acid 2 80 to about amino acid 284, of SEQ ID NO : 8 ; SEQ ID 
NO: 13, a kinase la (P-loop) subdomain, which extends from 
about amino acid 12 7 to about amino acid 212 of SEQ ID 
NO: 8; SEQ ID NO: 46, a kinase 2 subdomain, which extends 
from about amino acid 273 to about amino acid 2 88 of SEQ 
ID NO: 8; SEQ ID NO: 47, a kinase 3a subdomain, which 
extends from about amino acid 327 to about amino acid 338 
of SEQ ID NO: 8; SEQ ID NO: 14, and ten Leucine-rich 
repeats which extend from about amino acid 674 to about 
amino acid 950 of SEQ ID NO: 8. The first Leucine-rich 
repeat extends from about amino acid 674 to about amino 
acid 701 of SEQ ID NO : 8 ; SEQ ID NO: 15. The second 
Leucine-rich repeat extends from about amino acid 702 to 
about amino acid 727 of SEQ ID NO: 8; SEQ ID NO: 16. The 
third Leucine-rich repeat extends from about amino acid 
728 to about amino acid 754 of SEQ ID NO : 8 ; SEQ ID NO: 17. 
The fourth Leucine-rich repeat extends from about amino 
acid 755 to about amino acid 782 of SEQ ID NO: 8; SEQ ID 
NO: 18. The fifth Leucine-rich repeat extends from about 



WO 99/40102 



PCT/US99/02544 



- 6 - 

amino acid 783 to about amino acid 810 of SEQ ID NO: 8; 
SEQ ID NO: 19. The sixth Leucine-rich repeat extends from 
about amino acid 811 to about amino acid 83 8 of SEQ ID 
NO: 8; SEQ ID NO: 20. The seventh Leucine-rich repeat 
5 extends from about amino acid 839 to about amino acid 866 
of SEQ ID NO: 8; SEQ ID NO: 21. The eighth Leucine-rich 
repeat extends from about amino acid 8 67 to about amino 
acid 894 of SEQ ID NO: 8; SEQ ID NO: 22. The ninth 
Leucine-rich repeat extends from about amino acid 895 to 

10 about amino acid 922 of SEQ ID NO : 8 ; SEQ ID NO: 23 and the 
tenth leucine-rich repeat extends from about amino acid 
92 3 to about amino acid 950 of SEQ ID NO: 8; SEQ ID NO: 24. 

The partial cDNA of CARD-4S described below (SEQ 
ID NO: 25) has a 14 7 0 nucleotide open reading frame 

15 (nucleotides 1-1470 of SEQ ID NO: 25; SEQ ID NO: 27) which 
encodes a 490 amino acid protein (SEQ ID NO: 26) . CARD- 4 S 
protein possesses a CARD domain (amino acids 1-74 of SEQ 
ID NO:26; SEQ ID NO:28). CARD - 4 S is predicted to have a 
P-Loop which extends from about amino acid 163 to about 

20 amino acid 170 of SEQ ID NO:26; SEQ ID NO:29, and a 

Walker Box "B" which extends form about amino acid 241 to 
about amino acid 245 of SEQ ID NO: 26; SEQ ID NO: 30. 

A human CARD-4Y nucleotide cDNA sequence is 
presented (SEQ ID NO: 38) as is the amino acid sequence of 

25 the predicted CARD-4Y product (SEQ ID NO: 3 9) . A human 
CARD- 4 Z nucleotide cDNA sequence is presented (SEQ ID 
NO: 40) as is the amino acid sequence of the predicted 
CARD- 4 Z product (SEQ ID NO: 41) . A comparison of the 
CARD-4Y , CARD - 4 Z , and human CARD-4L predicted amino acid 

30 sequences is also shown in Figure 14, 

Like other proteins containing a CARD domain, both 
CARD- 3 and CARD-4 are expected to participate in the 
network of interactions that lead to caspase activity. 
Human CARD-4L is expected to play a functional role in 

35 caspase activation similar to that of Apaf-1 (Zou et al . , 
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Cell, 90:405-413, 1997). For example, upon activation, 
CARD - 4 L might bind a nucleotide, thus allowing CARD - 4 L to 
bind and activate a CARD-containing caspase via a 
CARD-CARD interaction, leading to apoptotic death of the 
5 cell. Accordingly, CARD-3 and CARD-4 molecules are 
useful as modulating agents in regulating a variety of 
cellular processes including cell growth and cell death. 
In one aspect, this invention provides isolated nucleic 
acid molecules encoding CARD-3 or CARD-4 proteins or 

10 biologically active portions thereof, as well as nucleic 
acid fragments suitable as primers or hybridization 
probes for the detection of CARD-3 or CARD-4 encoding 
nucleic acids. 

The invention encompasses methods of diagnosing 

15 and treating patients who are suffering from a disorder 
associated with an abnormal level or rate (undesirably 
high or undesirably low) of apoptotic cell death, 
abnormal activity of the Fas/APO-1 receptor complex, 
abnormal activity of the TNF receptor complex, or 

20 abnormal activity of a caspase by administering a 
compound that modulates the expression of CARD-3 or 
CARD-4 (at the DNA, mRNA or protein level, e.g., by 
altering mRNA splicing) or by altering the activity of 
CARD-3 or CARD-4. Examples of such compounds include 

25 small molecules, antisense nucleic acid molecules, 
ribozymes, and polypeptides. 

Certain disorders are associated with an increased 
number of surviving cells, which are produced and 
continue to survive or proliferate when apoptosis is 

3 0 inhibited. These disorders include cancer (particularly 
follicular lymphomas, carcinomas associated with 
mutations in p53, and hormone -dependent tumors such as 
breast cancer, prostate cancer, and ovarian cancer) , 
autoimmune disorders (such as systemic lupus 

35 erythematosis , immune -mediated glomerulonephritis) , and 
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viral infections (such as those caused by herpesviruses, 
poxviruses, and adenoviruses) . 

Failure to remove autoimmune cells that arise 
during development or that develop as a result of somatic 
5 mutation during an immune response can result in 

autoimmune disease . One of the molecules that plays a 
critical role in regulating cell death in lymphocytes is 
the cell surface receptor for Fas. 

Populations of cells are often depleted in the 

10 event of viral infection, with perhaps the most dramatic 
example being the cell depletion caused by the human 
immunodeficiency virus (HIV) . Surprisingly, most T cells 
that die during HIV infections do not appear to be 
infected with HIV. Although a number of explanations 

15 have been proposed, recent evidence suggests that 

stimulation of the CD4 receptor results in the enhanced 
susceptibility of uninfected T cells to undergo 
apoptosis . 

A wide variety of neurological diseases are 
20 characterized by the gradual loss of specific sets of 
neurons. Such disorders include Alzheimer's disease, 
Parkinson's disease, amyotrophic lateral sclerosis (ALS) 
retinitis pigmentosa, spinal muscular atrophy, and 
various forms of cerebellar degeneration. The cell loss 

2 5 in these diseases does not induce an inflammatory 

response, and apoptosis appears to be the mechanism of 
cell death. 

In addition, a number of hematologic diseases are 
associated with a decreased production of blood cells. 

3 0 These disorders include anemia associated with chronic 

disease, aplastic anemia, chronic neutropenia, and the 
myelodysplastic syndromes. Disorders of blood cell 
production, such as myelodysplastic syndrome and some 
forms of aplastic anemia, are associated with increased 
35 apoptotic cell death within the bone marrow. These 
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disorders could result from the activation of genes that 
promote apoptosis, acquired deficiencies in stromal cells 
or hematopoietic survival factors, or the direct effects 
of toxins and mediators of immune responses. 
5 Two common disorders associated with cell death 

are myocardial infarctions and stroke. In both 
disorders, cells within the central area of ischemia, 
which is produced in the event of acute loss of blood 
flow, appear to die rapidly as a result of necrosis. 

10 However, outside the central ischemic zone, cells die 
over a more protracted time period and morphologically 
appear to die by apoptosis. 

The invention features a nucleic acid molecule 
which is at least 45% (or 55%, 65%, 75%, 85%, 95%, or 

15 98%) identical to the nucleotide sequence shown in SEQ ID 
NO:l, SEQ ID NO : 3 , SEQ ID NO : 7 , SEQ ID NO : 9 , SEQ ID: 25, 
SEQ ID NO: 27, SEQ ID NO: 38, SEQ ID NO: 40, and SEQ ID 
NO: 42, the nucleotide sequence of the cDNA insert of the 
plasmid deposited with ATCC as Accession Number (the 

20 " cDNA of ATCC n ), the nucleotide sequence of the 

cDNA insert of the plasmid deposited with ATCC as 

Accession Number (the " cDNA of ATCC " ) , the 

nucleotide sequence of the cDNA insert of the plasmid 
deposited with ATCC as Accession Number (the " cDNA of 

2 5 ATCC "), or a complement thereof. 

The invention features a nucleic acid molecule 
which includes a fragment of at least 150 (300, 325, 350, 
375, 400, 425, 450, 500, 550, 600, 650, 700, 800, 900, 
1000, 1300, 1600 or 1931) nucleotides of the nucleotide 

3 0 sequence shown in SEQ ID N0:1, or SEQ ID NO : 3 , or the 

nucleotide sequence of the cDNA ATCC , or a 

complement thereof . 

The invention also features a nucleic acid 
molecule which includes a fragment of at least 150 (350, 
35 400, 450, 500, 550, 600, 650, 700, 800, 900, 1000, 1300, 
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1600, 1900, 2100, 2400, 2700, 3000, or 3382) nucleotides 
of the nucleotide sequence shown in SEQ ID NO: 7, SEQ ID 
NO: 9, SEQ ID NO: 4 3 or the nucleotide sequence of the 

cDNA ATCC , or a complement thereof. 

5 Also within the invention is a nucleic acid 

molecule which includes a fragment of at least 150 (350, 
400, 450, 500, 550, 600, 650, 700, 800, 900, 1000, 1300, 
1600, 1900, 2100, 2400, 2700, and 3080) nucleotides of 
the nucleotide sequence shown in SEQ ID NO: 25, SEQ ID 
10 NO:27, SEQ ID NO:38, SEQ ID N0:40, or the nucleotide 

sequence of the cDNA ATCC , or a complement 

thereof . 

The invention features a nucleic acid molecule 
which includes a nucleotide sequence encoding a protein 
15 having an amino acid sequence that is at least 45% (or 
55%, 65%, 75%, 85%, 95%, or 98%) identical to the amino 
acid sequence of SEQ ID NO: 2, SEQ ID NO: 8, SEQ ID NO: 26, 
SEQ ID NO:39, SEQ ID NO:41, and SEQ ID NO:43, or the 
amino acid sequence encoded by the cDNA of ATCC , the 

2 0 amino acid sequence encoded by the cDNA of ATCC , or 

the amino acid sequence encoded by the cDNA of ATCC . 

In an embodiment, a CARD-3 nucleic acid molecule 
has the nucleotide sequence shown in SEQ ID NO:l, or SEQ 
ID NO: 3, or the nucleotide sequence of the cDNA of ATCC 

25 . In another embodiment, a CARD-4L nucleic acid 

molecule has the nucleotide sequence shown in SEQ ID 
NO: 7, or SEQ ID NO : 9 , or the nucleotide sequence of the 

cDNA of ATCC . In yet another embodiment, a CARD-4S 

nucleic acid molecule has the nucleotide sequence shown 

3 0 in SEQ ID NO: 25, or SEQ ID NO: 27, or the nucleotide 

sequence of the cDNA of ATCC . In another 

embodiment, a murine CARD-4L nucleic acid molecule has 
the nucleotide sequence shown in SEQ ID NO: 42. In 
another embodiment, a CARD-4Y nucleic acid molecule has 
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the nucleotide sequence shown in SEQ ID NO: 38 or the 

nucleotide sequence of the cDNA of ATCC . In another 

embodiment, a CARD - 4 Z nucleic acid molecule has the 
nucleotide sequence shown in SEQ ID NO: 40 or the 

5 nucleotide sequence of the cDNA of ATCC . 

Also within the invention is a nucleic acid 
molecule which encodes a fragment of a polypeptide having 
the amino acid sequence of SEQ ID NO : 2 or SEQ ID NO : 8 or 
SEQ ID NO: 26 or SEQ ID NO : 3 9 or SEQ ID NO: 41 or SEQ ID 

10 NO:43, the fragment including at least 15 (25, 30, 50, 

100, 150, 300, 400 or 540, 600, 700, 800, 953) contiguous 
amino acids of SEQ ID NO: 2 or SEQ ID NO : 8 or SEQ ID NO: 26 
or SEQ ID NO: 39 or SEQ ID NO: 41 or SEQ ID NO: 43 or the 
polypeptide encoded by the cDNA of ATCC Accession Number 

15 , or the polypeptide encoded by the cDNA of ATCC 

Accession Number or the polypeptide encoded by the 

cDNA of ATCC 

Accession Number . 

The invention includes a nucleic acid molecule 

20 which encodes a naturally occurring allelic variant of a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2 or SEQ ID NO: 39 or SEQ ID NO: 41 or SEQ ID NO: 43 or 
an amino acid sequence encoded by the cDNA of ATCC 
Accession Number , wherein the nucleic acid molecule 

25 hybridizes to a nucleic acid molecule comprising SEQ ID 
NO:l or SEQ ID NO : 3 or SEQ ID NO: 38 or SEQ ID NO: 40 or 
SEQ ID NO:42 under stringent conditions. The invention 
also includes a nucleic acid molecule which encodes a 
naturally occurring allelic variant of a polypeptide 

30 comprising the amino acid sequence of SEQ ID NO : 8 or an 
amino acid sequence encoded by the cDNA of ATCC Accession 

Number , wherein the nucleic acid molecule 

hybridizes to a nucleic acid molecule comprising SEQ ID 
NO: 7 or SEQ ID NO : 9 under stringent conditions. 
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The invention also includes a nucleic acid 
molecule which encodes a naturally occurring allelic 
variant of a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 26 or an amino acid sequence 

5 encoded by the cDNA of ATCC Accession Number , 

wherein the nucleic acid molecule hybridizes to a nucleic 
acid molecule comprising SEQ ID NO: 25 or SEQ ID NO: 27 
under stringent conditions. In general, an allelic 
variant of a gene will be readily identifiable as mapping 

10 to the same chromosomal location as said gene. For 

example, in Example 6, the chromosomal location of the 
human CARD-4 gene is discovered to be chromosome 7 close 
to the SHGC-31928 genetic marker. Allelic variants of 
human CARD-4 will be readily identifiable as mapping to 

15 the human CARD-4 locus on chromosome 7 near genetic 
marker SHGC-3192 8. 

Also within the invention are: an isolated CARD-3 
protein having an amino acid sequence that is at least 
about 65%, preferably 75%, 85%, 95%, or 98% identical to 

2 0 the amino acid sequence of SEQ ID NO : 2 ; an isolated 

CARD-3 protein having an amino acid sequence that is at 
least about 85%, 95%, or 98% identical to the kinase 
domain of SEQ ID NO:2 (e.g., about amino acid residues 1 
to 3 00 of SEQ ID NO: 2; SEQ ID NO: 4) ; and an isolated 

25 CARD-3 protein having an amino acid sequence that is at 
least about 85%, 95%, or 98% identical to the linker 
domain of SEQ ID NO:2 (e.g., about amino acid residues 
301 to 431 of SEQ ID NO : 2 ; SEQ ID NO:5); an isolated 
CARD-3 protein having an amino acid sequence that is at 

30 least about 85%, 95%, or 98% identical to the CARD domain 
of SEQ ID NO:2 (e.g., about amino acid residues 432 to 
540 of SEQ ID NO:2; SEQ ID NO:6); an isolated CARD-4L 
protein having an amino acid sequence that is at least 
about 65%, preferably 75%, 85%, 95%, or 98% identical to 

35 the amino acid sequence of SEQ ID NO: 8; an isolated 
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CARD - 4 L protein having an amino acid sequence that is at 
least about 85%, 95%, or 98% identical to the CARD domain 
of SEQ ID NO: 8 (e.g., about amino acid residues 15 to 114 
of SEQ ID NO:8; SEQ ID NO:10); an isolated CARD - 4 L 
5 protein having an amino acid sequence that is at least 
about 85%, 95%, or 98% identical to the nucleotide 
binding domain of SEQ ID NO:8 (e.g., about amino acid 
residues 198 to 397 of SEQ ID NO:8; SEQ ID NO: 11; an 
isolated CARD-4L protein having an amino acid sequence 
10 that is at least about 85%, 95%, or 98% identical to the 
kinase la (P-loop) subdomain SEQ ID NO : 8 (e.g., about 
amino acid 127 to about amino acid 212 of SEQ ID NO : 8 ; 
SEQ ID NO:46); an isolated CARD-4L protein having an 
amino acid sequence that is at least about 85%, 95%, or 
15 98% identical to the kinase 2 subdomain of SEQ ID NO : 8 
(e.g., about amino acid 273 to about amino acid 288 of 
SEQ ID NO:8; SEQ ID NO : 4 7 ) ; an isolated CARD - 4 L protein 
having an amino acid sequence that is at least about 85%, 
95%, or 98% identical to a kinase 3a subdomain of SEQ ID 
2 0 NO: 8 (e.g., about amino acid residues 32 7 to 338 of SEQ 
ID NO:8; SEQ ID NO:14); an isolated CARD-4L. protein 
having an amino acid sequence that is at least about 85%, 
95%, or 98% identical to the Leucine-rich repeats of SEQ 
ID NO:8 (e.g., about amino acid residues 674 to 701 of 
25 SEQ ID NO: 8; SEQ ID NO: 15; from amino acid 702 to amino 

acid 727 of SEQ ID NO: 8; SEQ ID NO: 16; which extends from 
amino acid 72 8 to amino acid 754 SEQ ID NO: 8; SEQ ID 
NO: 17; from amino acid 755 to amino acid 782 of SEQ ID 
NO: 8; SEQ ID NO: 18; from amino acid 783 to amino acid 810 
30 of SEQ ID NO: 8; SEQ ID NO: 19; from amino acid 811 to 
amino acid 83 8 of SEQ ID NO: 8; SEQ ID NO: 20 from amino 
acid 839 to amino acid 866 of SEQ ID NO : 8 ; SEQ ID NO:21; 
from amino acid 867 to amino acid 894 of SEQ ID NO: 8; SEQ 
ID NO: 22; from amino acid 895 to amino acid 922 of SEQ ID 
35 NO: 8; SEQ ID NO: 23; and from amino acid 923 to amino acid 
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950 of SEQ ID NO:8; SEQ ID NO:24); an isolated CARD-4S 
protein having an amino acid sequence that is at least 
about 65%, preferably 75%, 85%, 95%, or 98% identical to 
the amino acid sequence of SEQ ID NO: 26; an isolated 
5 CARD - 4 S J protein having an amino acid sequence that is at 
least about 85%, 95%, or 98% identical to the CARD domain 
of SEQ ID NO:26 (e.g., about amino acid residues 1 to 74 
of SEQ ID NO: 26; SEQ ID NO: 28) . Also within the invention 
are: an isolated murine CARD-4L protein having an amino 

10 acid sequence that is at least about 65%, preferably 75%, 
85%, 95%, or 98% identical to the amino acid sequence of 
SEQ ID NO: 43. Also within the invention are: an isolated 
CARD-4Y protein having an amino acid sequence that is at 
least about 65%, preferably 75%, 85%, 95%, or 98% 

15 identical to the amino acid sequence of SEQ ID NO: 39. 
Also within the invention are: an isolated CARD- 4 Z 
protein having an amino acid sequence that is at least 
about 65%, preferably 75%, 85%, 95%, or 98% identical to 
the amino acid sequence of SEQ ID NO: 41. 

20 Also within the invention are: an isolated CARD-3 
protein which is encoded by a nucleic acid molecule 
having a nucleotide sequence that is at least about 65%, 
preferably 75%, 85%, or 95% identical to SEQ ID NO : 3 or 
the cDNA of ATCC ; an isolated CARD-3 protein which 

2 5 is encoded by a nucleic acid molecule having a nucleotide 
sequence at least about 65% preferably 75%, 85%, or 95% 
identical to the kinase domain encoding portion of SEQ ID 
NO:l (e.g., about nucleotides 213 to 1113 of SEQ ID 
NO:l); an isolated CARD-3 protein which is encoded by a 

30 nucleic acid molecule having a nucleotide sequence at 

least about 65% preferably 75%, 85%, or 95% identical the 
linker domain encoding portion of SEQ ID NO:l (e.g., 
about nucleotides 1114 to 1506 of SEQ ID NO:l); and an 
isolated CARD-3 protein which is encoded by a nucleic 

35 acid molecule having a nucleotide sequence at least about 
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65% preferably 75%, 85%, or 95% identical the CARD domain 
encoding portion of SEQ ID NO : 1 (e.g., about nucleotides 
1507 to 1833 of SEQ ID NO : 1 ) ; and an isolated CARD-3 
protein which is encoded by a nucleic acid molecule 
5 having a nucleotide sequence which hybridizes under 
stringent hybridization conditions to a nucleic acid 
molecule having the nucleotide sequence of SEQ ID NO : 3 or 

the non- coding strand of the cDNA of ATCC . Also 

within the invention are: an isolated CARD- 4 Y protein 

10 which is encoded by a nucleic acid molecule having a 
nucleotide sequence that is at least about 65%, 
preferably 75%, 85%, or 95% identical to SEQ ID NO:38 or 

the cDNA of ATCC . Also within the invention are 

nucleic acid molecules which include about nucleotides 

15 2759 to 2842 of SEQ ID NO : 7 ; about nucleotides 2843 to 
2 92 6 of SEQ ID NO : 7 ; about nucleotides 2 92 7 to 3 010 of 
SEQ ID NO:7; about nucleotides 3011 to 3 094 of SEQ ID 
NO: 7; and an isolated CARD - 4 L protein which is encoded by 
a nucleic acid molecule having a nucleotide sequence 

20 which hybridizes under stringent hybridization conditions 
to a nucleic acid molecule having the nucleotide sequence 
of SEQ ID NO: 9 or the non- coding strand of the cDNA of 
ATCC . 

Also within the invention are: an isolated CARD-4S 
25 protein which is encoded by a nucleic acid molecule 

having a nucleotide sequence that is at least about 65%, 
preferably 75%, 85%, or 95% identical to SEQ ID NO: 27 or 

the cDNA of ATCC ; an isolated CARD-3 protein which 

is encoded by a nucleic acid molecule having a nucleotide 
30 sequence at least about 65% preferably 75%, 85%, or 95% 
identical the CARD domain encoding portion of SEQ ID 
NO:25 (e.g., about nucleotides 1 to 222 of SEQ ID NO:25); 
an isolated CARD-3 protein which is encoded by a nucleic 
acid molecule having a nucleotide sequence at least about 
35 65% preferably 75%, 85%, or 95% identical the P-Loop 
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encoding portion of SEQ ID NO:25 (e.g., about nucleotides 
485 to 510 Of SEQ ID NO:25) . 

Also within the invention is a polypeptide which 
is a naturally occurring allelic variant of a polypeptide 
5 that includes the amino acid sequence of SEQ ID NO: 2 or 
an amino acid sequence encoded by the cDNA insert of the 

plasmid deposited with ATCC as Accession Number , 

wherein the polypeptide is encoded by a nucleic acid 
molecule which hybridizes to a nucleic acid molecule 
10 comprising SEQ ID NO:l or SEQ ID NO : 3 under stringent 
conditions . 

Also within the invention is a polypeptide which 
is a naturally occurring allelic variant of a polypeptide 
that includes the amino acid sequence of SEQ ID NO: 8 or 

15 an amino acid sequence encoded by the cDNA insert of the 

plasmid deposited with ATCC as Accession Number 

_, wherein the polypeptide is encoded by a nucleic acid 
molecule which hybridizes to a nucleic acid molecule 
comprising SEQ ID NO : 7 or SEQ ID NO: 9 under stringent 

20 conditions . 

Also within the invention is a polypeptide which 
is a naturally occurring allelic variant of a polypeptide 
that includes the amino acid sequence of SEQ ID NO: 26 or 
an amino acid sequence encoded by the cDNA insert of the 

25 plasmid deposited with ATCC as Accession Number , 

wherein the polypeptide is encoded by a nucleic acid 
molecule which hybridizes to a nucleic acid molecule 
comprising SEQ ID NO: 25 or SEQ ID NO: 27 under stringent 
conditions . 

3 0 Another embodiment of the invention features 

CARD-3 or CARD-4 nucleic acid molecules which 
specifically detect CARD-3 or CARD-4 nucleic acid 
molecules, relative to nucleic acid molecules encoding 
other members of the CARD superfamily. For example, in 

35 one embodiment, a CARD-4L nucleic acid molecule 
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hybridizes under stringent conditions to a nucleic acid 
molecule comprising the nucleotide sequence of SEQ ID 

NO: 7, SEQ ID NO : 9 , or the cDNA of ATCC , or a 

complement thereof. In another embodiment, the CARD - 4 L 

5 nucleic acid molecule is at least 300 (350, 400, 450, 
500, 550, 600, 650, 700, 800, 900, 1000, 1300, 1600, 
1900, 2100, 2400, 2700, 3000, or 3382) nucleotides in 
length and hybridizes under stringent conditions to a 
nucleic acid molecule comprising the nucleotide sequence 

0 shown in SEQ ID NO: 7, SEQ ID NO: 9, the cDNA of ATCC 

, or a complement thereof. In another embodiment, 

an isolated CARD- 4 L nucleic acid molecule comprises 
nucleotides 287 to 586 of SEQ ID NO: 7, encoding the CARD 
domain of CARD - 4 L , or a complement thereof. In yet 

5 another embodiment, the invention provides an isolated 
nucleic acid molecule which is antisense to the coding 
strand of a CARD- 4 L nucleic acid. 

Another aspect of the invention provides a vector, 
e.g., a recombinant expression vector, comprising a 

0 CARD-3 or a CARD-4L nucleic acid molecule of the 

invention. In another embodiment the invention provides 
a host cell containing such a vector. The invention also 
provides a method for producing CARD-3 or CARD- 4 protein 
by culturing, in a suitable medium, a host cell of the 

5 invention containing a recombinant expression vector such 
that a CARD-3 or CARD-4 protein is produced. 

Another aspect of this invention features isolated 
or recombinant CARD-3 or CARD-4 proteins and 
polypeptides. Preferred CARD-3 or CARD-4 proteins and 

0 polypeptides possess at least one biological activity 
possessed by naturally occurring human CARD-3 or CARD-4, 
e.g., (1) the ability to 

form protein: protein interactions with proteins in the 
apoptotic signalling pathway; (2) the ability to form 
5 CARD-CARD interactions with proteins in the apoptotic 
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signaling pathway; (3) the ability to bind the CARD-3 or 
CARD-4 ligand; (4) and the ability to bind to an 
intracellular target. Other activities include: (1) 
modulation of cellular proliferation, (2) modulation of 
5 cellular differentiation and (3) modulation of cellular 
death (4) modulation of the NF-kB pathway. 

The CARD-3 or CARD-4 proteins of the present 
invention, or biologically active portions thereof, can 
be operatively linked to a non-CARD-3 or non-CARD-4 

10 polypeptide (e.g., heterologous amino acid sequences) to 
form CARD-3 or CARD-4 fusion proteins, respectively. The 
invention further features antibodies that specifically 
bind CARD-3 or CARD-4 proteins, such as monoclonal or 
polyclonal antibodies. In addition, the CARD-3 or CARD-4 

15 proteins or biologically active portions thereof can be 
incorporated into pharmaceutical compositions, which 
optionally include pharmaceutically acceptable carriers. 

In another aspect, the present invention provides 
a method for detecting the presence of CARD-3 or CARD-4 

20 activity or expression in a biological sample by 

contacting the biological sample with an agent capable of 
detecting an indicator of CARD-3 or CARD-4 activity such 
that the presence of CARD-3 or CARD-4 activity is 
detected in the biological sample. 

25 In another aspect, the invention provides a method 

for modulating CARD-3 or CARD-4 activity comprising 
contacting a cell with an agent that modulates (inhibits 
or stimulates) CARD-3 or CARD-4 activity or expression 
such that CARD-3 or CARD-4 activity or expression in the 

3 0 cell is modulated. In one embodiment, the agent is an 
antibody that specifically binds to CARD-3 or CARD-4 
protein. In another embodiment, the agent modulates 
expression of CARD-3 or CARD-4 by modulating 
transcription of a CARD-3 or CARD-4 gene, splicing of a 
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CARD- 3 or CARD-4 mRNA, or translation of a CARD- 3 or 
CARD- 4 mRNA. In yet another embodiment, the agent is a 
nucleic acid molecule having a nucleotide sequence that 
is antisense to the coding strand of the CARD-3 or CARD-4 
5 mRNA or the CARD -3 or CARD-4 gene. 

In one embodiment, the methods of the present 
invention are used to treat a subject having a disorder 
characterized by aberrant CARD-3 or CARD-4 protein or 
nucleic acid expression or activity by administering an 
10 agent which is a CARD-3 or CARD-4 modulator to the 
subject. In one embodiment, the CARD-3 or CARD-4 
modulator is a CARD-3 or CARD-4 protein. In another 
embodiment the CARD-3 or CARD-4 modulator is a CARD-3 or 
CARD-4 nucleic acid molecule. In other embodiments, the 
15 CARD-3 or CARD-4 modulator is a peptide, pept idomimetic , 
or other small molecule. 

The present invention also provides a diagnostic 
assay for identifying the presence or absence of a 
genetic lesion or mutation characterized by at least one 
20 of: (i) aberrant modification or mutation of a gene 

encoding a CARD-3 or CARD-4 protein; (ii) mis -regulation 
of a gene encoding a CARD-3 or CARD-4 protein; (iii) 
aberrant RNA splicing; and (iv) aberrant 
post-translational modification of a CARD-3 or CARD-4 
25 protein, wherein a wild-type form of the gene encodes a 
protein with a CARD-3 or CARD-4 activity. 

In another aspect, the invention provides a 
method for identifying a compound that binds to or 
modulates the activity of a CARD-3 or CARD-4 protein. In 
30 general, such methods entail measuring a biological 

activity of a CARD-3 or CARD-4 protein in the presence 
and absence of a test compound and identifying those 
compounds which alter the activity of the CARD-3 or 
CARD-4 protein. 
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The invention also features methods for 
identifying a compound which modulates the expression of 
CARD- 3 or CARD- 4 by measuring the expression of CARD- 3 or 
CARD- 4 in the presence and absence of a compound. 
5 Other features and advantages of the invention 

will be apparent from the following detailed description 
and claims. 



Brief Description of the Drawings 
Figure 1 depicts the cDNA sequence (SEQ ID NO:l) 
10 of human CARD-3. The open reading frame of CARD-3 (SEQ 
ID NO:l) extends from nucleotide 213 to nucleotide 1833 
nucleotide (SEQ ID N0:3). 

Figure 2 depicts the predicted amino acid sequence 
(SEQ ID NO: 2) of human CARD- 3. 
15 Figure 3 depicts the cDNA sequence (SEQ ID NO: 7) 

of CARD - 4 L . The open reading frame of SEQ ID NO: 7 
extends from nucleotide 245 to nucleotide 3103 (SEQ ID 
NO : 9 ) . 

Figure 4 depicts the predicted amino acid sequence 
2 0 (SEQ ID NO: 8) of human CARD-4L. 

Figure 5 depicts the partial cDNA sequence (SEQ ID 
NO: 25) of CARD-4S and the predicted amino acid sequence 
(SEQ ID NO: 25) of human CARD-4S. The open reading frame 
of CARD -4 (SEQ ID NO: 25) extends from nucleotide 1 to 
25 nucleotide 1470 (SEQ ID NO:27) . 

Figure 6 depicts the predicted amino acid sequence 
(SEQ ID NO: 26) of human CARD-4S. 

Figure 7 depicts an alignment of the CARD domains 
of CARD-4 (SEQ ID NO:10), CARD-3 (SEQ ID NO:6), ARC-CARD 
30 (SEQ ID NO:31), cIAPl-CARD (SEQ ID NO:32) and CIAP2 -CARD 
(SEQ ID NO:33) . 
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Figure 8 is a plot showing predicted structural 
features of human CARD-4L. 

Figure 9 is a plot showing predicted structural 
features of human CARD - 4 S . 
5 Figure 10 depicts the cDNA sequence (SEQ ID NO: 38) 

of the human CARD - 4 Y splice variant clone. The predicted 
open reading frame of the human CARD- 4 Y splice variant 
clone extends from nucleotide 438 to nucleotide 1184. 

Figure 11 depicts the amino acid sequence (SEQ ID 
10 NO: 3 9) of the protein predicted to be encoded by the 
human CARD - 4 Y cDNA open reading frame. 

Figure 12 depicts the cDNA sequence (SEQ ID NO: 40) 
of the human CARD-4Z splice variant clone. The predicted 
open reading frame of the human CARD- 4 Z splice variant 
15 clone extends from nucleotide 489 to nucleotide 980. 

Figure 13 depicts the amino acid sequence (SEQ ID 
NO: 41) of the protein predicted to be encoded by the 
human CARD- 4 Z cDNA open reading frame. 

Figure 14 depicts an alignment of human CARD- 4 L 
20 (SEQ ID NO:8), the predicted amino acid sequence of human 
CARD - 4 Y (SEQ ID NO:39), and the predicted amino acid 
sequence of human CARD- 4 Z (SEQ ID NO: 41) . 

Figure 15 depicts the nucleotide sequence of the 
murine CARD-4L cDNA (SEQ ID NO: 42) . 
25 Figure 16 depicts the predicted amino acid 

sequence of murine CARD-4L (SEQ ID NO:43). 

Figure 17 depicts an alignment of human CARD- 4 L 
(SEQ ID NO: 8) and the predicted amino acid sequence of 
murine CARD-4L (SEQ ID NO: 43) . 
30 Figure 18 depicts a 32042 nucleotide genomic 

sequence of CARD-4. 



Detailed Description of the Invention 
The present invention is based, in part, on the 
discovery of cDNA molecules encoding human CARD-3, human 



WO 99/40102 



PCT/US99/02544 



- 22 - 

CARD-4 and partial murine CARD-4L proteins. A nucleotide 
sequence encoding a human CARD- 3 protein is shown in 
Figure 1 (SEQ ID NO:l; SEQ ID NO : 3 includes the open 
reading frame only) . A predicted amino acid sequence of 
5 CARD-3 protein is also shown in Figure 2 (SEQ ID NO: 2) . 
CARD-4 has at least two forms, a long form, CARD-4L, and 
a short form, CARD-4S, as well as two or more splice 
variants. A nucleotide sequence encoding a human CARD-4L 
protein is shown in Figure 3 (SEQ ID NO: 7; SEQ ID NO: 9 

10 includes the open reading frame only) . A predicted amino 
acid sequence of CARD-4L protein is also shown in Figure 
4 (SEQ ID NO: 8) . A nucleotide sequence encoding a human 
CARD-4S protein is shown in Figure 5 (SEQ ID NO: 25; SEQ 
ID NO: 27 includes the open reading frame only) . A 

15 predicted amino acid sequence of CARD -4 S protein is also 
shown in Figure 6 (SEQ ID NO: 26) . Two additional splice 
variants of human CARD-4 are provided in Figures 10 and 
11 (human CARD-4Y) and Figures 12 and 13 (human CARD-4Z) 
(predicted amino acid sequences: SEQ ID NO: 39 and SEQ ID 

2 0 NO: 41 and nucleic acid sequences: SEQ ID NO: 38 and SEQ ID 

NO: 40) . These two splice variants are predicted to 
contain 24 9 and 164 amino acids, respectively. An 
alignment of human CARD-4 Y, human CARD-4 Z and human 
CARD-4L is shown in Figure 14. 
25 In addition to the human CARD-4 proteins, a full 

length nucleotide sequence of the murine ortholog of 
human CARD-4L is provided in Figure 15 (SEQ ID NO: 42) . An 
alignment of murine CARD - 4 L with human CARD-4L is shown 
in Figure 17. 

3 0 The human CARD- 3 cDNA of Figure 1 (SEQ ID N0:1) , 

which is approximately 1931 nucleotides long including 
untranslated regions, encodes a protein amino acid having 
a molecular weight of approximately 61 kDa (excluding 
post-translational modifications) . A plasmid containing 
35 a cDNA encoding human CARD- 3 (with the cDNA insert name 
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of) was deposited with American Type Culture Collection 

(ATCC) , Manasass, VA on and assigned Accession 

Number . This deposit will be maintained under 

the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the 
Purposes of Patent Procedure. This deposit was made 
merely as a convenience for those of skill in the art and 
is not an admission that a deposit is required under 35 
U.S.C. §112. 

The human CARD - 4 L cDNA of Figure 3 (SEQ ID NO:7), 
which is approximately 3 3 82 nucleotides long including 
untranslated regions, encodes a protein amino acid having 
a molecular weight of approximately 108 kDa (excluding 
post- translational modifications) . A plasmid containing 
a cDNA encoding human CARD-4L (with the cDNA insert name 

of ) was deposited with American Type Culture 

Collection (ATCC) , Manasass, VA on and assigned 

Accession Number . This deposit will be 

maintained under the terms of the Budapest Treaty on the 
0 International Recognition of the Deposit of 

Microorganisms for the Purposes of Patent Procedure. 
This deposit was made merely as a convenience for those 
of skill in the art and is not an admission that a 
deposit is required under 35 U.S.C. §112. 
5 The human CARD- 4 S cDNA of Figure 5 (SEQ ID NO: 25) , 

which is 3082 nucleotides long including untranslated 
regions . A plasmid containing a cDNA encoding human 

CARD - 4 S (with the cDNA insert name of ) was 

deposited with American Type Culture Collection (ATCC) , 

0 Manasass, VA on and assigned Accession Number 

This deposit will be maintained under the terms of the 
Budapest Treaty on the International Recognition of the 
Deposit of Microorganisms for the Purposes of Patent 
Procedure. This deposit was made merely as a convenience 
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for those of skill in the art and is not an admission 
that a deposit is required under 35 U.S.C. §112. 

A region of human CARD-4L protein (SEQ ID NO: 8) 
bears some similarity to a CARD domain of CARD-3 (SEQ ID 
5 NO:6), ARC-CARD (SEQ ID NO : 3 1 ) , cIAPl-CARD (SEQ ID 

NO: 32) , and CIAP2-CARD (SEQ ID NO: 33) . This comparison 
is depicted in Figure 7. 

Human CARD-3 or CARD -4 are members of a family of 
molecules (the "CARD family") having certain conserved 

10 structural and functional features. The term "family" 
when referring to the protein and nucleic acid molecules 
of the invention is intended to mean two or more proteins 
or nucleic acid molecules having a common structural 
domain and having sufficient amino acid or nucleotide 

15 sequence identity as defined herein. Such family members 
can be naturally occurring and can be from either the 
same or different species. For example, a family can 
contain a first protein of human origin and a homologue 
of that protein of murine origin, as well as a second, 

20 distinct protein of human origin and a murine homologue 
of that protein. Members of a family may also have 
common functional characteristics. 

In one embodiment, a CARD-3 or CARD- 4 protein 
includes a CARD domain having at least about 65% , 

2 5 preferably at least about 75%, and more preferably about 

85%, 95%, or 98% amino acid sequence identity to the CARD 
domain of SEQ ID NO: 6 or the CARD domain of SEQ ID NO: 10 
or the CARD domain of SEQ ID NO:28. 

Preferred CARD-3 or CARD-4 polypeptides of the 
30 present invention have an amino acid sequence 

sufficiently identical to the CARD domain consensus amino 
acid sequence of SEQ ID NO: 6, SEQ ID NO: 10, SEQ ID NO: 28, 
respectively. The CARD-3 polypeptide also has an amino 
acid sequence sufficiently identical to the kinase domain 

3 5 consensus sequence of SEQ ID NO:4, and an amino acid 
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sequence that is sufficiently identical to the linker 
domain of SEQ ID NO : 5 . The CARD - 4 L polypeptide has an 
amino acid sequence sufficiently identical to the 
nucleotide binding domain of SEQ ID NO: II, an amino acid 
5 sequence sufficiently identical to the Walker Box "A" of 
SEQ ID NO: 12 or Walker Box "B" of SEQ ID NO: 13, an amino 
acid sequence sufficiently identical to the kinase la 
subdomain of SEQ ID NO: 46, an amino acid sequence 
sufficiently identical to the kinase 2 subdomain of SEQ 

10 ID NO: 47, or an amino acid sequence sufficiently 

identical to the kinase 3a subdomain of SEQ ID NO: 14, or 
an amino acid sequence sufficiently identical to the 
Leucine-rich repeats of SEQ ID NO: 14, SEQ ID NO: 15, SEQ 
ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ 

15 ID NO:20, SEQ ID NO : 2 1 , SEQ ID NO:22, and SEQ ID NO:23. 
As used herein, the term 11 sufficiently identical" refers 
to a first amino acid or nucleotide sequence which 
contains a sufficient or minimum number of identical or 
equivalent (e.g., an amino acid residue which has a 

20 similar side chain) amino acid residues or nucleotides to 
a second amino acid or nucleotide sequence such that the 
first and second amino acid or nucleotide sequences have 
a common structural domain and/or common functional 
activity. For example, amino acid or nucleotide 

2 5 sequences which contain a common structural domain having 

about 65% identity, preferably 75% identity, more 
preferably 85%, 95%, or 98% identity are defined herein 
as sufficiently identical. 

As used interchangeably herein a 11 CARD- 3 or CARD- 4 
30 activity", "biological activity of CARD-3 or CARD-4" or 
"functional activity of CARD-3 or CARD-4", refers to an 
activity exerted by a CARD-3 or CARD-4 protein, 
polypeptide or nucleic acid molecule on a CARD-3 or 
CARD-4 responsive cell as determined in vivo, or in 

3 5 vitro, according to standard techniques. A CARD - 3 or 
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CARD-4 activity can be a direct activity, such as an 
association with or an enzymatic activity on a second 
protein or an indirect activity, such as a cellular 
signaling activity mediated by interaction of the CARD- 3 
5 or CARD-4 protein with a second protein. In an 

embodiment, a CARD-3 or CARD-4 activity includes at least 
one or more of the following activities: (i) interaction 
with proteins in the apoptotic signalling pathway (ii) 
interaction with a CARD-3 or CARD-4 ligand; or (iii) 

10 interaction with an intracellular target protein; (iv) 
indirect interaction with caspases. For example, in 
Example 4, CARD- 3 -containing proteins were shown to 
associate with CARD-4 -containing proteins. In example 9, 
CARD-4 proteins were shown to induce NF-eB-mediated 

15 transcription. In example 10, CARD-3 and CARD-4 were 
shown to enhance caspase 9 activity. 

Accordingly, another embodiment of the invention 
features isolated CARD-3 or CARD-4 proteins and 
polypeptides having a CARD-3 or CARD-4 activity. 

20 Various aspects of the invention are described in 

further detail in the following subsections. 

X* Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated 
nucleic acid molecules that encode CARD-3 or CARD-4 

25 proteins or biologically active portions thereof, as well 
as nucleic acid molecules sufficient for use as 
hybridization probes to identify CARD-3 or 
CARD-4 -encoding nucleic acids (e.g., CARD-3 or CARD-4 
mRNA) and fragments for use as PCR primers for the 

30 amplification or mutation of CARD-3 or CARD-4 nucleic 
acid molecules. As used herein, the term "nucleic acid 
molecule" is intended to include DNA molecules (e.g., 
cDNA or genomic DNA) and RNA molecules (e.g., mRNA) and 
analogs of the DNA or RNA generated using nucleotide 
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analogs. The nucleic acid molecule can be 
single-stranded or double -stranded, but preferably is 
double -stranded DNA. 

An "isolated 11 nucleic acid molecule is one which 
is separated from other nucleic acid molecules which are 
present in the natural source of the nucleic acid. 
Preferably, an "isolated" nucleic acid is free of 
sequences (preferably protein encoding sequences) which 
naturally flank the nucleic acid (i.e., sequences located 
at the 5' and 3' ends of the nucleic acid) in the genomic 
DNA of the organism from which the nucleic acid is 
derived. For example, in various embodiments, the 
isolated CARD-3 or CARD-4L/S nucleic acid molecule can 
contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 
kb or 0 . 1 kb of nucleotide sequences which naturally 
flank the nucleic acid molecule in genomic DNA of the 
cell from which the nucleic acid is derived. Moreover, 
an "isolated" nucleic acid molecule, such as a cDNA 
molecule, can be substantially free of other cellular 
material, or culture medium when produced by recombinant 
techniques, or substantially free of chemical precursors 
or other chemicals when chemically synthesized. 

A nucleic acid molecule of the present invention, 
e.g., a nucleic acid molecule having the nucleotide 
sequence of SEQ ID NO:l, SEQ ID NO : 3 , SEQ ID NO: 7, SEQ ID 
NO:9, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:38, SEQ ID 

NO: 40, SEQ ID NO: 42, the cDNA of ATCC , the cDNA of 

ATCC , or a complement of any of these nucleotide 

sequences, can be isolated using standard molecular 
biology techniques and the sequence information provided 
herein. Using all or portion of the nucleic acid 
sequences of SEQ ID NO : 1 , SEQ ID NO: 3, SEQ ID NO : 7 , SEQ 
ID NO: 9, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 38, SEQ ID 

NO: 40, SEQ ID NO: 42, the cDNA of ATCC or the cDNA 

of ATCC , as a hybridization probe, CARD-3 or CARD-4 
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nucleic acid molecules can be isolated using standard 
hybridization and cloning techniques (e.g., as described 
in Sambrook et al . , eds . , Molecular Cloning: A 
Laboratory Manual. 2nd, ed. , Cold Spring Harbor 
5 Laboratory, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1989) . 

A nucleic acid of the invention can be amplified 
using cDNA, mRNA or genomic DNA as a template and 
appropriate oligonucleotide primers according to standard 

10 PCR amplification techniques. The nucleic acid so 

amplified can be cloned into an appropriate vector and 
characterized by DNA sequence analysis. Furthermore, 
oligonucleotides corresponding to CARD-3 or CARD-4 
nucleotide sequences can be prepared by standard 

15 synthetic techniques, e.g., using an automated DNA 
synthesizer. 

In another embodiment, an isolated nucleic acid 
molecule of the invention comprises a nucleic acid 
molecule which is a complement of the nucleotide sequence 

20 shown in SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO: 7, SEQ ID 
NO: 9, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 38, SEQ ID 

NO: 40, SEQ ID NO: 42, the cDNA of ATCC or the cDNA 

of ATCC or a portion thereof. A nucleic acid 

molecule which is complementary to a given nucleotide 

25 sequence is one which is sufficiently complementary to 
the given nucleotide sequence that it can hybridize to 
the given nucleotide sequence thereby forming a stable 
duplex. 

Moreover, the nucleic acid molecule of the 
30 invention can comprise only a portion of a nucleic acid 
sequence encoding CARD-3 or CARD-4, for example, a 
fragment which can be used as a probe or primer or a 
fragment encoding a biologically active portion of CARD-3 
or CARD-4. The nucleotide sequence determined from the 
35 cloning of the human CARD-3 or CARD-4, and the partial 
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murine CARD- 4 gene allows for the generation of probes 
and primers designed for use in identifying and/or 
cloning CARD-3 or CARD-4 homologues in other cell types, 
e.g., from other tissues, as well as CARD-3 or CARD-4 
5 homologues and orthologs from other mammals. The 

probe/primer typically comprises substantially purified 
oligonucleotide. The oligonucleotide typically comprises 
a region of nucleotide sequence that hybridizes under 
stringent conditions to at least about 12, preferably 

10 about 25, more preferably about 50, 75, 100, 125, 150, 

175, 200, 250, 300, 350 or 400 consecutive nucleotides of 
the sense or anti-sense sequence of SEQ ID NO : 1 , SEQ ID 

NO: 3, SEQ ID NO : 7 , SEQ ID NO : 9 , the cDNA of ATCC , 

the cDNA of ATCC or of a naturally occurring mutant 

15 of CEQ ID NO:l, SEQ ID NO : 3 , SEQ ID NO : 7 , SEQ ID NO : 9 , 

the cDNA of ATCC , or the cDNA of ATCC . 

Probes based on the human CARD-3 or human CARD-4 
or murine CARD-4 nucleotide sequence can be used to 
detect transcripts or genomic sequences encoding the same 

20 or identical proteins. The probe comprises a label group 
attached thereto, e.g., a radioisotope, a fluorescent 
compound, an enzyme, or an enzyme co- factor. Such probes 
can be used as a part of a diagnostic test kit for 
identifying allelic variants and orthologs of the CARD-3 

25 and CARD-4 proteins of the present invention, identifying 
cells or tissue which mis-express a CARD-3 or CARD-4 
protein, such as by measuring a level of a CARD-3 or 
CARD-4 -encoding nucleic acid in a sample of cells from a 
subject, e.g., detecting CARD-3 or CARD-4 mRNA levels or 

30 determining whether a genomic CARD-3 or CARD-4 gene has 
been mutated or deleted. 

A nucleic acid fragment encoding a 11 biologically 
active portion of CARD-3 or CARD-4L" can be prepared by 
isolating a portion of SEQ ID N0:1, SEQ ID NO: 3, SEQ ID 

35 NO: 7, SEQ ID NO : 9 , or the nucleotide sequence of the cDNA 
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of ATCC , or the nucleotide sequence of the cDNA of 

ATCC which encodes a polypeptide having a CARD- 3 or 

CARD-4 biological activity, expressing the encoded 
portion of CARD-3 or CARD-4 protein (e.g., by recombinant 
5 expression in vitro) and assessing the activity of the 
encoded portion of CARD-3 or CARD-4. For example, a 
nucleic acid fragment encoding a biologically active 
portion of CARD-3 or CARD-4 includes a CARD domain, e.g., 
SEQ ID NO: 6 and SEQ ID NO: 10 or SEQ ID NO: 26. 

10 The invention further encompasses nucleic acid 

molecules that differ from the nucleotide sequence of SEQ 
ID NO:l, SEQ ID NO : 3 , SEQ ID NO : 7 , SEQ ID NO : 9 , SEQ ID 
NO:25, SEQ ID NO:27, SEQ ID NO:38, SEQ ID NO:40, SEQ ID 
NO: 42, the cDNA of ATCC or the cDNA of ATCC 

15 due to degeneracy of the genetic code and thus encode the 
same CARD-3 or CARD-4 protein as that encoded by the 
nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO : 3 , 
SEQ ID N0:7, SEQ ID NO : 9 , SEQ ID NO:25, SEQ ID NO:27, SEQ 
ID NO: 38, SEQ ID NO: 40, SEQ ID NO: 42, the cDNA of ATCC 

20 or the cDNA of ATCC . 

In addition to the human CARD-3 or CARD-4 
nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO: 3, 
SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 25, SEQ ID NO: 27, SEQ 
ID NO: 38, SEQ ID NO: 40, the cDNA of ATCC , the cDNA 

25 of ATCC , or the cDNA of ATCC , and the murine 

CARD-4L cDNA sequence shown in SEQ ID NO: 42 it will be 
appreciated by those skilled in the art that DNA sequence 
polymorphisms that lead to changes in the amino acid 
sequences of CARD-3 or CARD-4 may exist within a 

30 population (e.g., the human population). Such genetic 

polymorphism in the CARD-3 or CARD-4 gene may exist among 
individuals within a population due to natural allelic 
variation. As used herein, the terms "gene" and 
"recombinant gene" refer to nucleic acid molecules 

35 comprising an open reading frame encoding a CARD-3 or 
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CARD-4 protein, preferably a mammalian CARD-3 or CARD-4 
protein. Such natural allelic variations can typically 
result in 1-5% variance in the nucleotide sequence of the 
CARD-3 or CARD-4 gene. Any and all such nucleotide 
5 variations and resulting amino acid polymorphisms in 
CARD-3 or CARD-4 that are the result of natural allelic 
variation and that do not alter the functional activity 
of CARD-3 or CARD-4 are intended to be within the scope 
of the invention. 

10 Moreover, nucleic acid molecules encoding CARD-3 

or CARD-4 proteins from other species (CARD-3 or CARD-4 
orthologs /homologues ) , which have a nucleotide sequence 
which differs from that of a human CARD-3 or CARD-4, are 
intended to be within the scope of the invention. For 

15 example, Example 5 describes the murine CARD-4 ortholog. 
Nucleic acid molecules corresponding to natural allelic 
variants and homologues of the CARD-3 or CARD-4 cDNA of 
the invention can be isolated based on their identity to 
the human CARD-3 or human or murine CARD-4 nucleic acids 

2 0 disclosed herein using the human or murine cDNAs, or a 

portion thereof, as a hybridization probe according to 
standard hybridization techniques under stringent 
hybridization conditions. In general, an allelic variant 
of a gene will be readily identifiable as mapping to the 
25 same chromosomal location as said gene. For example, in 
Example 6, the chromosomal location of the human CARD-4 
gene is discovered to be chromosome 7 close to the 
SHGC-3192 8 genetic marker. Allelic variants of human 
CARD-4 will be readily identifiable as mapping to the 

3 0 human CARD-4 locus on chromosome 7 near genetic marker 

SHGC-31928 . 

Accordingly, in another embodiment, an isolated 
nucleic acid molecule of the invention is at least 300 
(325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 
35 800, 900, 1000, 1300, 1600 or 1931) nucleotides in length 
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acid molecule comprising the nucleotide sequence, 
preferably the coding sequence, of SEQ ID NO:l, SEQ ID 

NO: 3, or the cDNA of ATCC . In yet another 

5 embodiment, an isolated nucleic acid molecule of the 

invention is at least 300 (325, 350, 375, 400, 425, 450, 
500, 550, 600, 650, 700, 800, 900, 1000, or 1300, 1640, 
1900, 2200, 2500, 2800, 3100, or 3382) nucleotides in 
length and hybridizes under stringent conditions to the 

10 nucleic acid molecule comprising the nucleotide sequence, 
preferably the coding sequence, of SEQ ID NO: 7, SEQ ID 
NO: 9, SEQ ID NO : 3 8 , SEQ ID NO: 40, or the cDNA of ATCC 

. Accordingly, in another embodiment, an isolated 

nucleic acid molecule of the invention is at least 300 

15 (325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 
800, 900, 1000, or 1300, 1640, 1900, 2200, 2500, 2800, 
3100, 3300, 3600, 3900, 4200 or 4209) nucleotides in 
length and hybridizes under stringent conditions to the 
nucleic acid molecule comprising the nucleotide sequence, 

20 preferably the coding sequence, of SEQ ID NO:42. 

As used herein, the term "hybridizes under 
stringent conditions" is intended to describe conditions 
for hybridization and washing under which nucleotide 
sequences at least 60% (65%, 70%, preferably 75%) 

25 identical to each other typically remain hybridized to 

each other . Such stringent conditions are known to those 
skilled in the art and can be found in Current Protocols 
in Molecular Biology, John Wiley & Sons, N.Y. (1989) , 
6.3.1-6.3.6. An, non- limiting example of stringent 

30 hybridization conditions are hybridization in 6X sodium 
chloride/sodium citrate (SSC) at about 45?C, followed by 
one or more washes in 0.2 X SSC, 0.1% SDS at 50-65?C. 
Preferably, an isolated nucleic acid molecule of the 
invention that hybridizes under stringent conditions to 

35 the sequence of SEQ ID N0:1, SEQ ID NO: 3, the cDNA of 
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ATCC corresponds to a naturally-occurring nucleic 

acid molecule. As used herein, a "naturally-occurring" 
nucleic acid molecule refers to an RNA or DNA molecule 
having a nucleotide sequence that occurs in nature (e.g., 
5 encodes a natural protein) . 

In addition to naturally-occurring allelic 
variants of the CARD- 3 or CARD-4 sequence that may exist 
in the population, the skilled artisan will further 
appreciate that changes can be introduced by mutation 

10 into the nucleotide sequence of SEQ ID NO:l, SEQ ID NO : 3 , 
SEQ ID NO: 7, SEQ ID NO : 9 , SEQ ID NO: 25, SEQ ID NO: 26, SEQ 
ID NO: 38, SEQ ID NO: 40, SEQ ID NO: 42, the cDNA of ATCC 

, the cDNA of ATCC , or the cDNA of ATCC 

, thereby leading to changes in the amino acid 

15 sequence of the encoded CARD-3, CARD-4L/S protein, CARD-4 
splice variant, or murine CARD-4 without altering the 
functional ability of the CARD- 3, CARD-4L./S, CARD-4 
splice variant, or murine CARD-4 protein. For example, 
one can make nucleotide substitutions leading to amino 

20 acid substitutions at "non-essential" amino acid 

residues. A "non-essential" amino acid residue is a 
residue that can be altered from the wild-type sequence 
of CARD -3 , CARD - 4 L / S , CARD-4 splice variant, or murine 
CARD-4 protein (e.g., the sequence of SEQ ID NO:2, SEQ ID 

25 NO: 8, SEQ ID NO: 26, SEQ ID NO: 39, SEQ ID NO: 41 and SEQ ID 
NO: 43) without altering the biological activity, whereas 
an "essential" amino acid residue is required for 
biological activity. For example, amino acid residues 
that are conserved among the CARD -3 , CARD-4L/S, CARD-4 

3 0 splice variant, or murine CARD-4 proteins of various 
species are predicted to be particularly unamenable to 
alteration . 

For example, preferred CARD- 3 or CARD-4 proteins 
of the present invention, contain at least one CARD 
3 5 domain. Additionally, a CARD -3 protein also contains at 
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least one kinase domain or at least one linker domain. A 
CARD domain contains at least one nucleotide binding 
domain or Leucine -rich repeats. Such conserved domains 
are less likely to be amenable to mutation. Other amino 
5 acid residues, however, (e.g., those that are not 

conserved or only semi -conserved among CARD- 3 or CARD- 4 
of various species) may not be essential for activity and 
thus are likely to be amenable to alteration. 

Accordingly, another aspect of the invention 

10 pertains to nucleic acid molecules encoding CARD- 3 or 
CARD-4 proteins that contain changes in amino acid 
residues that are not essential for activity. Such 
CARD-3 or CARD-4 proteins differ in amino acid sequence 
from SEQ ID NO: 2, SEQ ID NO: 8, SEQ ID NO: 25, SEQ ID 

15 NO: 39, SEQ ID NO : 4 1 , or SEQ ID NO: 43 and yet retain 
biological activity. In one embodiment, the isolated 
nucleic acid molecule includes a nucleotide sequence 
encoding a protein that includes an amino acid sequence 
that is at least about 45% identical, 65%, 75%, 85%, 95%, 

20 or 98% identical to the amino acid sequence of SEQ ID 
NO:2, SEQ ID NO : 8 , SEQ ID NO: 26, SEQ ID NO: 39, SEQ ID 
NO: 41 or SEQ ID NO: 43. 

An isolated nucleic acid molecule encoding a 
CARD-3 or CARD-4 proteins having a sequence which differs 

25 from that of SEQ ID NO:2, SEQ ID NO: 8, SEQ ID NO: 26, SEQ 
ID NO: 39, SEQ ID NO: 41 or SEQ ID NO: 43, can be created by 
introducing one or more nucleotide substitutions, 
additions or deletions into the nucleotide sequence of 
CARD-3 (SEQ ID NO : 1 , SEQ ID NO: 3, the cDNA of ATCC ) 

3 0 or CARD-4L (SEQ ID NO : 7 , SEQ ID NO : 9 , the cDNA of 

ATCC ), or CARD- 4 S (SEQ ID NO:25, SEQ ID NO:27, the 

cDNA of ATCC ) , or human CARD-4 splice variants (SEQ 

ID NO: 38, SEQ ID NO: 40, the cDNA of ATCC or the cDNA 

of ATCC ), or murine CARD-4 (SEQ ID NO: 42) such that 

35 one or more amino acid substitutions, additions or 
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deletions are introduced into the encoded protein. 
Mutations can be introduced by standard techniques, such 
as site-directed mutagenesis and PCR-mediated 
mutagenesis. Preferably, conservative amino acid 
substitutions are made at one or more predicted 
non-essential amino acid residues. A "conservative amino 
acid substitution" is one in which the amino acid residue 
is replaced with an amino acid residue having a similar 
side chain. Families of amino acid residues having 
similar side chains have been defined in the art. These 
families include amino acids with basic side chains 
(e.g., lysine, arginine, histidine) , acidic side chains 
(e.g., aspartic acid, glutamic acid), uncharged polar 
side chains (e.g., glycine, asparagine, glutamine, 
serine, threonine, tyrosine, cysteine) , nonpolar side 
chains (e.g., alanine, valine, leucine, isoleucine, 
proline, phenylalanine, methionine, tryptophan) , 
beta-branched side chains (e.g., threonine, valine, 
isoleucine) and aromatic side chains (e.g., tyrosine, 
phenylalanine, tryptophan, histidine) . Thus, a predicted 
nonessential amino acid residue in CARD- 3 or CARD-4 is 
preferably replaced with another amino acid residue from 
the same side chain family. Alternatively, mutations can 
be introduced randomly along all or part of a CARD- 3 or 
CARD-4 coding sequence, such as by saturation 
mutagenesis, and the resultant mutants can be screened 
for CARD-3 or CARD-4 biological activity to identify 
mutants that retain activity. Following mutagenesis, the 
encoded protein can be expressed recombinantly and the 
activity of the protein can be determined. 

In an embodiment, a mutant CARD-3 or CARD-4 
protein can be assayed for: (1) the ability to form 
protein:protein interactions with proteins in the 
apoptotic signalling pathway; (2) the ability to bind a 
CARD-3 or CARD-4 ligand; or (3) the ability to bind to an 
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intracellular target protein. For example, (1) in 
Example 7, a two-hybrid screening assay for the physical 
interaction of CARD-3 and CARD-4 is shown, (2) in Example 
8, a two-hybrid system assay for the interaction between 
5 CARD-4 and its ligand hNUDC is described, and (3) in 
Example 12, a coimmunoprecipitation assay for the 
interaction of CARD-3 with its ligand CARD-4 is shown. 
In yet another embodiment, a mutant CARD-3 or CARD-4 
protein can be assayed for the ability to modulate 

10 cellular proliferation, cellular differentiation, or 
cellular death. For example, in Example 10, assays for 
the regulation of cellular death (apoptosis) by CARD-3 or 
CARD-4 are described. In yet another embodiment, a 
mutant CARD-3 or CARD-4 protein can be assayed for 

15 regulation of a cellular signal transduction pathway. 

For example, in Example 9, an assay for the regulation by 
CARD-4 of the NF-kB pathway is described. 

The present invention encompasses antisense 
nucleic acid molecules, i.e., molecules which are 

2 0 complementary to a sense nucleic acid encoding a protein, 
e.g., complementary to the coding strand of a 
double -stranded cDNA molecule or complementary to an mRNA 
sequence. Accordingly, an antisense nucleic acid can 
hydrogen bond to a sense nucleic acid. The antisense 

2 5 nucleic acid can be complementary to an entire CARD-3 or 
CARD-4 coding strand, or to only a portion thereof, e.g. , 
all or part of the protein coding region (or open reading 
frame) . An antisense nucleic acid molecule can be 
antisense to a noncoding region of the coding strand of a 

30 nucleotide sequence encoding CARD-3 or CARD-4. The 

noncoding regions ("5' and 3' untranslated regions") are 
the 5 ' and 3 ' sequences which flank the coding region and 
are not translated into amino acids. 

Given the coding strand sequences encoding CARD-3 

35 or CARD-4 disclosed herein (e.g., SEQ ID N0:1, SEQ ID 
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NO: 3, SEQ ID NO : 7 , SEQ ID NO : 9 , SEQ ID NO: 25, SEQ ID 
NO:27, SEQ ID NO:38, SEQ ID NO : 4 0 , or SEQ ID NO:42), 
antisense nucleic acids of the invention can be designed 
according to the rules of Watson and Crick base pairing. 
5 The antisense nucleic acid molecule can be complementary 
to the entire coding region of CARD- 3 or CARD-4L/S mRNA, 
but more preferably is an oligonucleotide which is 
antisense to only a portion of the coding or noncoding 
region of CARD-3 or CARD-4 mRNA. For example, the 
10 antisense oligonucleotide can be complementary to the 
region surrounding the translation start site of CARD-3 
mRNA, e.g., an oligonucleotide having the sequence 
CCCTGGTACTTGCCCCTCCGGTAG (SEQ ID NO: 34) or 

CCTGGTACTTGCCCCTCC (SEQ ID NO: 35) or of the CARD- 4 L mRNA 

15 e.g., TCGTTAAGCCCTTGAAGACAGTG ( SEQ ID NO : 3 6 ) and 

TCGTTAGCCCTTGAAGACCAGTGAGTGTAG (SEQ ID NO: 37) . An 
antisense oligonucleotide can be, for example, about 5, 
10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in 
length. An antisense nucleic acid of the invention can 

20 be constructed using chemical synthesis and enzymatic 
ligation reactions using procedures known in the art. 
For example, an antisense nucleic acid (e.g., an 
antisense oligonucleotide) can be chemically synthesized 
using naturally occurring nucleotides or variously 

25 modified nucleotides designed to increase the biological 
stability of the molecules or to increase the physical 
stability of the duplex formed between the antisense and 
sense nucleic acids, e.g., phosphorothioate derivatives 
and acridine substituted nucleotides can be used. 

3 0 Examples of modified nucleotides which can be used to 
generate the antisense nucleic acid include 
5-f luorouracil , 5 -bromouracil , 5-chlorouracil , 
5 - iodouraci 1 , hypoxanthine , xanthine , 4 -acetylcytosine , 
5- (carboxyhydroxylmethyl) uracil, 

3 5 5 - carboxymethy laminomethyl - 2 - thiouridine , 
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5-carboxymethylaminomethyluracil , dihydrouracil , 
beta-D-galactosylqueosine , inosine , 

N6 - isopentenyladenine , 1 -methyl guanine , 1 -methyl inosine , 
2 , 2 -dimethylguanine , 2 -methyl adenine , 2 -methylguanine , 
5 3 -methyl cytosine, 5 -methyl cytosine , N6 -adenine, 
7 -methylguanine, 5-methylaminomethyluracil , 
5 - me t hoxy ami nome thyl-2-thi our a c i 1 , 

beta-D-mannosylqueosine , 5 ' -methoxycarboxymethyluracil , 
5 -methoxyuracil , 2 -methylthio-N6 - isopentenyladenine , 

10 uracil -5 -oxyacetic acid (v) , wybutoxosine , pseudouracil , 
queosine, 2 - thiocytosine , 5-methyl -2 - thiouracil , 
2 - thiouracil , 4 -thiouracil , 5-methyluracil , 
uracil -5 -oxyacetic acid methylester, uracil -5 -oxyacetic 
acid (v) , 5-methyl-2-thiouracil , 

15 3- (3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 

2 , 6-diaminopurine . Alternatively, the antisense nucleic 
acid can be produced biologically using an expression 
vector into which a nucleic acid has been subcloned in an 
antisense orientation (i.e., RNA transcribed from the 

20 inserted nucleic acid will be of an antisense orientation 
to a target nucleic acid of interest, described further 
in the following subsection) . 

The antisense nucleic acid molecules of the 
invention are typically administered to a subject or 

25 generated in situ such that they hybridize with or bind 
to cellular mRNA and/or genomic DNA encoding a CARD-3 or 
CARD- 4 protein to thereby inhibit expression of the 
protein, e.g., by inhibiting transcription and/or 
translation. The hybridization can be by conventional 

3 0 nucleotide complementarity to form a stable duplex, or, 
for example, in the case of an antisense nucleic acid 
molecule which binds to DNA duplexes, through specific 
interactions in the major groove of the double helix. An 
example of a route of administration of antisense nucleic 

35 acid molecules of the invention include direct injection 
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at a tissue site. Alternatively, antisense nucleic acid 
molecules can be modified to target selected cells and 
then administered systemically . For example, for 
systemic administration, antisense molecules can be 
5 modified such that they specifically bind to receptors or 
antigens expressed on a selected cell surface, e.g., by 
linking the antisense nucleic acid molecules to peptides 
or antibodies which bind to cell surface receptors or 
antigens. The antisense nucleic acid molecules can also 

10 be delivered to cells using the vectors described herein. 
To achieve sufficient intracellular concentrations of the 
antisense molecules, vector constructs in which the 
antisense nucleic acid molecule is placed under the 
control of a strong pol II or pol III promoter are 

15 preferred. 

An antisense nucleic acid molecule of the 
invention can be an a-anomeric nucleic acid molecule. An 
a-anomeric nucleic acid molecule forms specific 
double- stranded hybrids with complementary RNA in which, 

20 contrary to the usual fi-units, the strands run parallel 
to each other (Gaultier et al . (1987) Nucleic Acids. Res. 
15:6625-6641). The antisense nucleic acid molecule can 
also comprise a 2 ' -o-methylribonucleotide (Inoue et al . 
(1987) Nucleic Acids Res. 15:6131-6148) or a chimeric 

25 RNA-DNA analogue (Inoue et al . (1987) FEBS Lett. 
215 : 327-330) . 

The invention also encompasses ribozymes . 
Ribozymes are catalytic RNA molecules with ribonuclease 
activity which are capable of cleaving a single-stranded 

3 0 nucleic acid, such as an mRNA, to which they have a 

complementary region. Thus, ribozymes (e.g., hammerhead 
ribozymes (described in Haselhoff and Gerlach (1988) 
Nature 334:585-591)) can be used to catalytically cleave 
CARD- 3 or CARD-4 mRNA transcripts to thereby inhibit 

3 5 translation of CARD- 3 or CARD-4 mRNA. A ribozyme having 
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specificity for a CARD-3 or CARD- 4 -encoding nucleic acid 
can be designed based upon the nucleotide sequence of a 
CARD-3 or CARD-4 cDNA disclosed herein (e.g., SEQ ID 
NO:l, SEQ ID NO : 3 , SEQ ID NO : 7 , SEQ ID NO : 9 , SEQ ID 
5 NO: 25, SEQ ID NO: 27, SEQ ID NO: 38, SEQ ID NO: 40, and SEQ 
ID NO: 42) . For example, a derivative of a Tetrahymena 
L-19 IVS RNA can be constructed in which the nucleotide 
sequence of the active site is complementary to the 
nucleotide sequence to be cleaved in a CARD-3 or 

10 CARD-4 -encoding mRNA. See, e.g., Cech et al . U.S. Patent 
No. 4,987,071; and Cech et al . U.S. Patent No. 5,116,742. 
Alternatively, CARD-3 or CARD-4 mRNA can be used to 
select a catalytic RNA having a specific ribonuclease 
activity from a pool of RNA molecules. See, e.g., Bartel 

15 and Szostak (1993) Science 261:1411-1418. 

The invention also encompasses nucleic acid 
molecules which form triple helical structures. For 
example, CARD-3 or CARD-4 gene expression can be 
inhibited by targeting nucleotide sequences complementary 

20 to the regulatory region of the CARD-3 or CARD-4 (e.g., 
the CARD-3 or CARD-4 promoter and/or enhancers) to form 
triple helical structures that prevent transcription of 
the CARD-3 or CARD-4 gene in target cells. See 
generally, Helene (1991) Anticancer Drug Des. 

25 6(6) :569-84; Helene (1992) Ann. N.Y. Acad. Sci . 

660:27-36; and Maher (1992) Bioassays 14 (12 ): 807-15 . 

In embodiments, the nucleic acid molecules of the 
invention can be modified at the base moiety, sugar 
moiety or phosphate backbone to improve, e.g., the 

30 stability, hybridization, or solubility of the molecule. 
For example, the deoxyribose phosphate backbone of the 
nucleic acids can be modified to generate peptide nucleic 
acids (see Hyrup et al . (1996) Bioorganic & Medicinal 
Chemistry 4(1) : 5-23) . As used herein, the terms 

35 "peptide nucleic acids" or "PNAs" refer to nucleic acid 
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mimics, e.g., DNA mimics, in which the deoxyribose 
phosphate backbone is replaced by a pseudopept ide 
backbone and only the four natural nucleobases are 
retained. The neutral backbone of PNAs has been shown to 
5 allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA 
oligomers can be performed using standard solid phase 
peptide synthesis protocols as described in Hyrup et al . 
(1996) supra; Perry -0' Keef e et al . (1996) Proc . Natl. 

10 Acad. Sci. USA 93: 14670-675. 

PNAs of CARD- 3 or CARD- 4 can be used for 
therapeutic and diagnostic applications. For example, 
PNAs can be used as antisense or antigene agents for 
sequence-specific modulation of gene expression by, e.g., 

15 inducing transcription or translation arrest or 

inhibiting replication. PNAs of CARD-3 or CARD-4 can 
also be used, e.g., in the analysis of single base pair 
mutations in a gene by, e.g., PNA directed PCR clamping; 
as 'artificial restriction enzymes when used in 

20 combination with other enzymes, e.g., SI nucleases (Hyrup 
(1996) supra; or as probes or primers for DNA sequence 
and hybridization (Hyrup (1996) supra; Perry-O' Keef e et 
al . (1996) Proc. Natl. Acad. Sci. USA 93: 14670-675). 

In another embodiment, PNAs of CARD-3 or CARD-4 

25 can be modified, e.g., to enhance their stability or 

cellular uptake, by attaching lipophilic or other helper 
groups to PNA, by the formation of PNA -DNA chimeras, or 
by the use of liposomes or other techniques of drug 
delivery known in the art. For example, PNA -DNA chimeras 

30 of CARD-3 or CARD-4 can be generated which may combine 
the advantageous properties of PNA and DNA. Such 
chimeras allow DNA recognition enzymes, e.g., RNAse H and 
DNA polymerases, to interact with the DNA portion while 
the PNA portion would provide high binding affinity and 

35 specificity. PNA -DNA chimeras can be linked using 
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linkers of appropriate lengths selected in terms of base 
stacking, number of bonds between the nucleobases, and 
orientation (Hyrup (1996) supra) . The synthesis of 
PNA-DNA chimeras can be performed as described in Hyrup 
5 (1996) supra and Finn et al . (1996) Nucleic Acids 

Research 24 (17) : 3357-63 . For example, a DNA chain can be 
synthesized on a solid support using standard 
phosphoramidite coupling chemistry and modified 
nucleoside analogs, e.g., 

10 5' - (4-methoxytrityl) amino- 5' -deoxy- thymidine 

phosphoramidite, can be used as a between the PNA and the 
5' end of DNA (Mag et al . (1989) Nucleic Acid Res. 
17:5973-88) . PNA monomers are then coupled in a stepwise 
manner to produce a chimeric molecule with a 5' PNA 

15 segment and a 3' DNA segment (Finn et al . (1996) Nucleic 
Acids Research 24 (17) : 3357-63) . Alternatively, chimeric 
molecules can be synthesized with a 5' DNA segment and a 
3' PNA segment (Pe terser et al . (1975) Bioorganic Med. 
Chem. Lett. 5:1119-11124) . 

2 0 In other embodiments, the oligonucleotide may 

include other appended groups such as peptides (e.g., for 
targeting host cell receptors in vivo) , or agents 
facilitating transport across the cell membrane (see, 
e.g., Letsinger et al . (1989) Proc . Natl. Acad. Sci. USA 

25 86:6553-6556; Lemaitre et al . (1987) Proc. Natl. Acad. 

Sci. USA 84:648-652; PCT Publication No. WO 88/09810) or 
the blood-brain barrier (see, e.g., PCT Publication No. 
W0 89/10134) . In addition, oligonucleotides can be 
modified with hybridization- triggered cleavage agents 

30 (see, e.g., Krol et al . (1988) Bio/Techniques 6:958-976) 
or intercalating agents (see, e.g., Zon (1988) Pharm. 
Res. 5:539-549). To this end, the oligonucleotide may be 
conjugated to another molecule, e.g., a peptide, 
hybridization triggered cross-linking agent, transport 

35 agent, hybridization- triggered cleavage agent, etc. 
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II. Isolated CARD- 3 or CARD -4 Proteins and An t i - CARD - 3 
or CARD-4 Antibodies. 

One aspect of the invention pertains to isolated 
CARD-3 or CARD-4 proteins, and biologically active 
5 portions thereof, as well as polypeptide fragments 

suitable for use as immunogens to raise anti-CARD-3 or 
CARD-4 antibodies. In one embodiment, native CARD-3 or 
CARD-4 proteins can be isolated from cells or tissue 
sources by an appropriate purification scheme using 

10 standard protein purification techniques. In another 
embodiment, CARD-3 or CARD-4 proteins are produced by 
recombinant DNA techniques. Alternative to recombinant 
expression, a CARD-3 or CARD-4 protein or polypeptide can 
be synthesized chemically using standard peptide 

15 synthesis techniques. 

An "isolated" or "purified" protein or 
biologically active portion thereof is substantially free 
of cellular material or other contaminating proteins from 
the cell or tissue source from which the CARD-3 or CARD-4 

2 0 protein is derived, or substantially free from chemical 

precursors or other chemicals when chemically 
synthesized. The language "substantially free of 
cellular material" includes preparations of CARD-3 or 
CARD-4 protein in which the protein is separated from 
25 cellular components of the cells from which it is 

isolated or recombinantly produced. Thus, CARD-3 or 
CARD-4 protein that is substantially free of cellular 
material includes preparations of CARD-3 or CARD-4 
protein having less than about 30%, 20%, 10%, or 5% (by 

3 0 dry weight) of non-CARD-3 or CARD-4 protein (also 

referred to herein as a "contaminating protein") . When 
the CARD-3 or CARD-4 protein or biologically active 
portion thereof is recombinantly produced, it is also 
preferably substantially free of culture medium, i.e., 
3 5 culture medium represents less than about 20%, 10%, or 5% 
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of the volume of the protein preparation. When CARD-3 or 
CARD-4 protein is produced by chemical synthesis, it is 
preferably substantially free of chemical precursors or 
other chemicals, i.e., it is separated from chemical 
5 precursors or other chemicals which are involved in the 
synthesis of the protein. Accordingly such preparations 
of CARD-3 or CARD-4 protein have less than about 3 0%, 
20%, 10%, 5% (by dry weight) of chemical precursors or 
non-CARD-3 or CARD-4 chemicals. 

10 Biologically active portions of a CARD-3 or CARD-4 

protein include peptides comprising amino acid sequences 
sufficiently identical to or derived from the amino acid 
sequence of the CARD-3 or CARD-4 protein (e.g., the amino 
acid sequence shown in SEQ ID NO: 2, SEQ ID NO: 8, SEQ ID 

15 NO:26, SEQ ID NO : 3 9 , SEQ ID NO:41 or SEQ ID NO:43), which 
include less amino acids than the full length CARD-3 or 
CARD-4 proteins, and exhibit at least one activity of a 
CARD-3 or CARD-4 protein. Typically, biologically active 
portions comprise a domain or motif with at least one 

20 activity of the CARD-3 or CARD-4 protein. A biologically 
active portion of a CARD-3 or CARD-4 protein can be a 
polypeptide which is, for example, 10, 25, 50, 100 or 
more amino acids in length. Preferred biologically 
active polypeptides include one or more identified CARD-3 

25 or CARD-4 structural domains, e.g., the CARD domain (SEQ 
ID NO: 6 or SEQ ID NO: 10 or SEQ ID NO: 27) . 

Moreover, other biologically active portions, in 
which other regions of the protein are deleted, can be 
prepared by recombinant techniques and evaluated for one 

3 0 or more of the functional activities of a native CARD-3 
or CARD-4 protein. 

CARD-3 or CARD-4 protein has the amino acid 
sequence shown of SEQ ID NO: 2, SEQ ID NO: 8, SEQ ID NO: 26, 
SEQ ID NO: 39, SEQ ID NO: 41 or SEQ ID NO: 43. Other useful 

35 CARD-3 or CARD-4 proteins are substantially identical to 
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SEQ ID NO: 2 or SEQ ID NO : 8 or SEQ ID NO: 26, SEQ ID NO : 3 9 
or SEQ ID NO: 41 or SEQ ID NO: 43 and retain the functional 
activity of the protein of SEQ ID NO : 2 or SEQ ID NO: 8 or 
SEQ ID NO: 26, SEQ ID NO: 39 or SEQ ID NO : 4 1 or SEQ ID 
NO: 43 yet differ in amino acid sequence due to natural 
allelic variation or mutagenesis. CARD-3 and CARD-4 are 
involved in activating caspases in the apoptotic pathway. 
For example, in Example 10, CARD-4 is shown to enhance 
caspase 9 activity. Accordingly, a useful CARD-3 or 
CARD-4 protein is a protein which includes an amino acid 
sequence at least about 45%, preferably 55%, 65%, 75%, 
85%, 95%, or 99% identical to the amino acid sequence of 
SEQ ID NO: 2 or SEQ ID NO : 8 or SEQ ID NO: 26, SEQ ID NO: 39 
or SEQ ID NO: 41 or SEQ ID NO: 43 and retains the 
functional activity of the CARD-3 or CARD-4 proteins of 
SEQ ID NO: 2 or SEQ ID NO : 8 or SEQ ID NO: 26, SEQ ID NO: 39 
or SEQ ID NO: 41 or SEQ ID NO: 43. In other instances, the 
CARD-3 or CARD-4 protein is a protein having an amino 
acid sequence 55%, 65%, 75%, 85%, 95%, or 98% identical 
0 to the CARD-3 or CARD - 4 L CARD domain (SEQ ID NO: 6, SEQ ID 
NO: 10 and SEQ ID NO: 27) . In an embodiment, the CARD-3 or 
CARD-4 protein retains a functional activity of the 
CARD-3 or CARD-4 protein of SEQ ID NO:2, SEQ ID NO : 8 or 
SEQ ID NO: 26, SEQ ID NO: 39 or SEQ ID NO: 41 or SEQ ID 
5 NO: 43. 

To determine the percent identity of two amino 
acid sequences or of two nucleic acids, the sequences are 
aligned for optimal comparison purposes (e.g., gaps can 
be introduced in the sequence of a first amino acid or 

0 nucleic acid sequence for optimal alignment with a second 
amino or nucleic acid sequence) . The amino acid residues 
or nucleotides at corresponding amino acid positions or 
nucleotide positions are then compared. When a position 
in the first sequence is occupied by the same amino acid 

5 residue or nucleotide as the corresponding position in 
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the second sequence, then the molecules are identical at 
that position. The percent identity between the two 
sequences is a function of the number of identical 
positions shared by the sequences (i.e., % identity = # 
5 of identical positions/total # of positions x 100) . 

The determination of percent homology between two 
sequences can be accomplished using a mathematical 
algorithm. A preferred, non-limiting example of a 
mathematical algorithm utilized for the comparison of two 

10 sequences is the algorithm of Karlin and Altschul (1990) 
Proc. Nat'l Acad. Sci . USA 87:2264-2268, modified as in 
Karlin and Altschul (1993) Proc. Nat'l Acad. Sci. USA 
90:5873-5877. Such an algorithm is incorporated into the 
NBLAST and XBLAST programs of Altschul, et al . (1990) J. 

15 Mol . Biol. 215:403-410, BLAST nucleotide searches can be 
performed with the NBLAST program, score = 100, 
wordlength = 12 to obtain nucleotide sequences similar or 
homologous to CARD- 3 or CARD-4 nucleic acid molecules of 
the invention. For example, Example 5 describes the use 

2 0 of the TBLiASTN program to query a database of sequences 
of full length and partial cDNA sequences with the human 
CARD-4 polypeptide sequence leading to the discovery of 
murine CARD-4 and Example 4 describes the use of BLASTN 
to query a proprietary EST database with the 5' 

25 untranslated sequence of CARD-4 leading to the discovery 
of two human CARD-4 splice variants. BLAST protein 
searches can be performed with the XBLAST program, score 
= 50, wordlength = 3 to obtain amino acid sequences 
homologous to CARD-3 or CARD-4 protein molecules of the 

30 invention. To obtain gapped alignments for comparison 
purposes, Gapped BLAST can be utilized as described in 
Altschul et al., (1997) Nucleic Acids Res. 25:3389-3402. 
When utilizing BLAST and Gapped BIxAST programs, the 
default parameters of the respective programs (e.g., 

35 XBLAST and NBLAST) can be used. See 
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http://www.ncbi.nlm.nih.gov. Another preferred, 
non-limiting example of a mathematical algorithm utilized 
for the comparison of sequences is the algorithm of Myers 
and Miller, CABIOS (1989) . Such an algorithm is 
incorporated into the ALIGN program (version 2.0) which 
is part of the GCG sequence alignment software package. 
When utilizing the ALIGN program for comparing amino acid 
sequences, a PAM12 0 weight residue table, a gap length 
penalty of 12, and a gap penalty of 4 can be used. 

The percent identity between two sequences can be 
determined using techniques similar to those described 
above, with or without allowing gaps. In calculating 
percent identity, only exact matches are counted. 

The invention also provides CARD- 3 or CARD- 4 
chimeric or fusion proteins. As used herein, a CARD-3 or 
CARD-4 "chimeric protein" or "fusion protein" comprises a 
CARD- 3 or CARD-4 polypeptide operatively linked to a 
non-CARD-3 or CARD-4 polypeptide. A "CARD-3 or CARD-4 
polypeptide" refers to a polypeptide having an amino acid 
sequence corresponding to CARD-3 or CARD- 4 L/ S , murine 
CARD-4 or human CARD-4 splice variants, whereas a 
"non-CARD-3 or CARD-4 polypeptide" refers to a 
polypeptide having an amino acid sequence corresponding 
to a protein which is not substantially identical to the 
CARD-3 or CARD-4L/S protein, murine CARD-4, or human 
CARD-4 splice variants e.g., a protein which is different 
from the CARD-3 or CARD-4 proteins and which is derived 
from the same or a different organism. Within a CARD-3 
or CARD - 4 L fusion protein, the CARD-3 or CARD-4 
polypeptide can correspond to all or a portion of a 
CARD-3 or CARD-4 protein, preferably at least one 
biologically active portion of a CARD-3 or CARD-4 
protein. Within the fusion protein, the term 
"operatively linked" is intended to indicate that the 
CARD-3 or CARD-4 polypeptide and the non-CARD-3 or 
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non-CARD-4 polypeptide are fused in- frame to each other. 
The non-CARD-3 or non-CARD-4 polypeptide can be fused to 
the N-terminus or C-terminus of the CARD- 3 or CARD -4 
polypeptide . 

5 One useful fusion protein is a GST-CARD-3 or 

GST-CARD-4 fusion protein in which the CARD- 3 or CARD-4 
sequences are fused to the C-terminus of the GST 
sequences. Such fusion proteins can facilitate the 
purification of recombinant CARD-3 or CARD-4. 

10 In another embodiment, the fusion protein contains 

a signal sequence from another protein. In certain host 
cells (e.g., mammalian host cells), expression and/or 
secretion of CARD-3 or CARD-4 can be increased through 
use of a heterologous signal sequence. For example, the 

15 gp67 secretory sequence of the baculovirus envelope 
protein can be used as a heterologous signal sequence 
(Current Protocols in Molecular Biology, Ausubel et al . , 
eds . , John Wiley & Sons, 1992). Other examples of 
eukaryotic heterologous signal sequences include the 

20 secretory sequences of melittin and human placental 

alkaline phosphatase (Stratagene; La Jolla, California) . 
In yet another example, useful prokaryotic heterologous 
signal sequences include the phoA secretory signal 
(Molecular cloning, Sambrook et al, second edition, Cold 

25 spring harbor laboratory press, 198 9) and the protein A 
secretory signal (Pharmacia Biotech; Piscataway, New 
Jersey) . 

In yet another embodiment, the fusion protein is a 
CARD-3 or CARD-4 -immunoglobulin fusion protein in which 

30 all or part of CARD-3 or CARD-4 is fused to sequences 
derived from a member of the immunoglobulin protein 
family. The CARD-3 or CARD-4 -immunoglobulin fusion 
proteins of the invention can be incorporated into 
pharmaceutical compositions and administered to a subject 

35 to inhibit an interaction between a CARD-3 or CARD-4 
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ligand and a CARD-3 or CARD-4 protein on the surface of a 
cell, to thereby suppress CARD-3 or CARD- 4 -mediated 
signal transduction in vivo. The CARD-3 or 
CARD- 4 -immunoglobulin fusion proteins can be used to 
5 affect the bioavailability of a CARD-3 or CARD-4 cognate 
ligand. Inhibition of the CARD-3 ligand/CARD-3 or CARD-4 
ligand/CARD-4 interaction may be useful therapeutically 
for both the treatment of proliferative and 
dif f erent iative disorders, as well as modulating (e.g. 

10 promoting or inhibiting) cell survival. Moreover, the 
CARD-3 or CARD- 4 - immunoglobulin fusion proteins of the 
invention can be used as immunogens to produce 
anti-CARD-3 or CARD-4 antibodies in a subject, to purify 
CARD-3 or CARD-4 ligands and in screening assays to 

15 identify molecules which inhibit the interaction of 
CARD-3 or CARD-4 with a CARD-3 or CARD-4 ligand. 

Preferably, a CARD-3 or CARD-4 chimeric or fusion 
protein of the invention is produced by standard 
recombinant DNA techniques. For example, DNA fragments 

20 coding for the different polypeptide sequences are 

ligated together in- frame in accordance with conventional 
techniques, for example by employing blunt -ended or 
stagger-ended termini for ligation, restriction enzyme 
digestion to provide for appropriate termini, filling- in 

25 of cohesive ends as appropriate, alkaline phosphatase 
treatment to avoid undesirable joining, and enzymatic 
ligation. In another embodiment, the fusion gene can be 
synthesized by conventional techniques including 
automated DNA synthesizers. Alternatively, PCR 

30 amplification of gene fragments can be carried out using 
anchor primers which give rise to complementary overhangs 
between two consecutive gene fragments which can 
subsequently be annealed and reamplified to generate a 
chimeric gene sequence (see, e.g., Current Protocols in 

35 Molecular Biology, Ausubel et al . eds . , John Wiley & 
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Sons: 1992). Moreover, many expression vectors are 
commercially available that already encode a fusion 
moiety (e.g., a GST polypeptide). A CARD-3 or 
CARD- 4 -encoding nucleic acid can be cloned into such an 
5 expression vector such that the fusion moiety is linked 
in- frame to the CARD-3 or CARD-4 protein. 

The present invention also pertains to variants of 
the CARD-3 or CARD-4 proteins which function as either 
CARD-3 or CARD-4 agonists (mimetics) or as CARD-3 or 

10 CARD-4 antagonists. Variants of the CARD-3 or CARD-4 
protein can be generated by mutagenesis, e.g., discrete 
point mutation or truncation of the CARD-3 or CARD-4 
protein. An agonist of the CARD-3 or CARD-4 protein can 
retain substantially the same, or a subset, of the 

15 biological activities of the naturally occurring form of 
the CARD-3 or CARD-4 protein. An antagonist of the 
CARD-3 or CARD-4 protein can inhibit one or more of the 
activities of the naturally occurring form of the CARD-3 
or CARD-4 protein by, for example, competitively binding 

20 to a downstream or upstream member of a cellular 

signaling cascade which includes the CARD-3 or CARD-4 
protein. Thus, specific biological effects can be 
elicited by treatment with a variant of limited function. 
Treatment of a subject with a variant having a subset of 

25 the biological activities of the naturally occurring form 
of the protein can have fewer side effects in a subject 
relative to treatment with the naturally occurring form 
of the CARD-3 or CARD-4 proteins. 

Variants of the CARD-3 or CARD-4 protein which 

3 0 function as either CARD-3 or CARD-4 agonists (mimetics) 
or as CARD-3 or CARD-4 antagonists can be identified by 
screening combinatorial libraries of mutants, e.g., 
truncation mutants of the CARD-3 or CARD-4 protein for 
CARD-3 or CARD-4 protein agonist or antagonist activity. 

35 In one embodiment, a variegated library of CARD-3 or 
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CARD-4 variants is generated by combinatorial mutagenesis 
at the nucleic acid level and is encoded by a variegated 
gene library. A variegated library of CARD- 3 or CARD-4 
variants can be produced by, for example, enzymat ically 
5 ligating a mixture of synthetic oligonucleotides into 
gene sequences such that a degenerate set of potential 
CARD- 3 or CARD-4 sequences is expressible as individual 
polypeptides, or alternatively, as a set of larger fusion 
proteins (e.g., for phage display) containing the set of 

10 CARD- 3 or CARD-4 sequences therein. There are a variety 
of methods which can be used to produce libraries of 
potential CARD-3 or CARD-4 variants from a degenerate 
oligonucleotide sequence. Chemical synthesis of a ^ 
degenerate gene sequence can be performed in an automatic 

15 DNA synthesizer, and the synthetic gene then ligated into 
an appropriate expression vector. Use of a degenerate 
set of genes allows for the provision, in one mixture, of 
all of the sequences encoding the desired set of 
potential CARD-3 or CARD-4 sequences. Methods for 

2 0 synthesizing degenerate oligonucleotides are known in the 
art (see, e.g., Narang (1983) Tetrahedron 39:3; Itakura 
et al . (1984) Annu. Rev. Biochem. 53:323; Itakura et al . 
(1984) Science 198:1056; Ike et al . (1983) Nucleic Acid 
Res. 11:477) . 

2 5 Useful fragments of CARD-3 and CARD-4 include 

fragments comprising or consisting of a domain or 
subdomain described herein, e.g., a kinase domain or a 
CARD domain. 

In addition, libraries of fragments of the CARD-3 

3 0 or CARD-4 protein coding sequence can be used to generate 

a variegated population of CARD-3 or CARD-4 fragments for 
screening and subsequent selection of variants of a 
CARD-3 or CARD-4 protein. In one embodiment, a library 
of coding sequence fragments can be generated by treating 
35 a double stranded PCR fragment of a CARD-3 or CARD-4 
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coding sequence with a nuclease under conditions wherein 
nicking occurs only about once per molecule, denaturing 
the double stranded DNA, renaturing the DNA to form 
double stranded DNA which can include sense/ant isense 
5 pairs from different nicked products, removing single 
stranded portions from reformed duplexes by treatment 
with SI nuclease, and ligating the resulting fragment 
library into an expression vector. By this method, an 
expression library can be derived which encodes 

10 N-terminal and internal fragments of various sizes of the 
CARD- 3 or CARD -4 protein. 

Several techniques are known in the art for 
screening gene products of combinatorial libraries made 
by point mutations or truncation, and for screening cDNA 

15 libraries for gene products having a selected property. 
Such techniques are adaptable for rapid screening of the 
gene libraries generated by the combinatorial mutagenesis 
of CARD-3 or CARD-4 proteins. The most widely used 
techniques, which are amenable to high through-put 

20 analysis, for screening large gene libraries typically 
include cloning the gene library into replicable 
expression vectors, transforming appropriate cells with 
the resulting library of vectors, and expressing the 
combinatorial genes under conditions in which detection 

25 of a desired activity facilitates isolation of the vector 
encoding the gene whose product was detected. Recursive 
ensemble mutagenesis (REM) , a technique which enhances 
the frequency of functional mutants in the libraries, can 
be used in combination with the screening assays to 

3 0 identify CARD-3 or CARD-4 variants (Arkin and Yourvan 

(1992) Proc. Natl. Acad. Sci . USA 89:7811-7815; Delgrave 
et al . (1993) Protein Engineering 6 (3 ): 327-331) . 

An isolated CARD-3 or CARD-4 protein, or a portion 
or fragment thereof , can be used as an immunogen to 

35 generate antibodies that bind CARD-3 or CARD-4 using 
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standard techniques for polyclonal and monoclonal 
antibody preparation. The full-length CARD- 3 or CARD-4 
protein can be used or, alternatively, the invention 
provides antigenic peptide fragments of CARD-3 or CARD-4 
5 for use as immunogens . The antigenic peptide of CARD-3 or 
CARD-4 comprises at least 8 (preferably 10, 15, 20, or 
30) amino acid residues of the amino acid sequence shown 
in SEQ ID NO : 2 , SEQ ID NO : 8 or SEQ ID NO: 26, or SEQ ID 
NO: 3 9 or SEQ ID NO : 4 1 or SEQ ID NO: 43 or polypeptides 

10 including amino acids 128-139 or 287-298 of human CARD - 4 L 
and encompasses an epitope of CARD-3 or CARD-4 such that 
an antibody raised against the peptide forms a specific 
immune complex with CARD-3 or CARD-4. 

Useful antibodies include antibodies which bind to 

15 a domain or subdomain of CARD-3 or CARD-4 described 
hherein, e.g., a kinase domain or a CARD domain) . 

Preferred epitopes encompassed by the antigenic 
peptide are regions of CARD-3 or CARD-4 that are located 
on the surface of the protein, e.g., hydrophilic regions. 

20 Other important criteria include a preference for a 
terminal sequence, high antigenic index (e.g., as 
predicted by Jameson-Wolf algorithm) , ease of peptide 
synthesis (e.g., avoidance of prolines); and high surface 
probability (e.g., as predicted by the Emini algorithm; 

25 Figure 8 and Figure 9) . 

A CARD-3 or CARD-4 immunogen typically is used to 
prepare antibodies by immunizing a suitable subject, 
(e.g., rabbit, goat, mouse or other mammal) with the 
immunogen. An appropriate immunogenic preparation can 

30 contain, for example, recombinantly expressed CARD-3 or 
CARD-4 protein or a chemically synthesized CARD-3 or 
CARD-4 polypeptide. The preparation can further include 
an adjuvant, such as Freund's complete or incomplete 
adjuvant, or similar immunostimulatory agent. 

35 Immunization of a suitable subject with an immunogenic 
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CARD-3 or CARD-4 preparation induces a polyclonal 
anti-CARD-3 or CARD-4 antibody response. For example, 
polypeptides including amino acids 128-139 or 287-298 of 
human CARD - 4 L were conjugated to KLH and the resulting 
5 conjugates were used to immunize rabbits and polyclonal 
antibodies that specifically recognize the two immunogen 
peptides were generated. 

Accordingly, another aspect of the invention 
pertains to anti-CARD-3 or CARD-4 antibodies. The term 

10 "antibody" as used herein refers to immunoglobulin 
molecules and immunologically active portions of 
immunoglobulin molecules, i.e., molecules that contain an 
antigen binding site which specifically binds an antigen, 
such as CARD-3 or CARD-4. A molecule which specifically 

15 binds to CARD-3 or CARD-4 is a molecule which binds 

CARD-3 or CARD-4, but does not substantially bind other 
molecules in a sample, e.g., a biological sample, which 
naturally contains CARD-3 or CARD-4. Examples of 
immunologically active portions of immunoglobulin 

20 molecules include F(ab) and F(ab')2 fragments which can 
be generated by treating the antibody with an enzyme such 
as pepsin. The invention provides polyclonal and 
monoclonal antibodies that bind CARD-3 or CARD-4. The 
term "monoclonal antibody" or "monoclonal antibody 

25 composition", as used herein, refers to a population of 
antibody molecules that contain only one species of an 
antigen binding site capable of immunoreacting with a 
particular epitope of CARD-3 or CARD-4. A monoclonal 
antibody composition thus typically displays a single 

30 binding affinity for a particular CARD-3 or CARD-4 
protein with which it immunoreacts . 

Polyclonal anti-CARD-3 or CARD-4 antibodies can be 
prepared as described above by immunizing a suitable 
subject with a CARD-3 or CARD-4 immunogen. The 

35 anti-CARD-3 or CARD-4 antibody titer in the immunized 
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subject can be monitored over time by standard 
techniques, such as with an enzyme linked immunosorbent 
assay (ELISA) using immobilized CARD-3 or CARD-4. If 
desired, the antibody molecules directed against CARD-3 
5 or CARD-4 can be isolated from the mammal (e.g., from the 
blood) and further purified by well-known techniques, 
such as protein A chromatography to obtain the IgG 
fraction. At an appropriate time after immunization, 
e.g., when the anti-CARD-3 or CARD-4 antibody titers are 

10 highest, antibody-producing cells can be obtained from 
the subject and used to prepare monoclonal antibodies by 
standard techniques, such as the hybridoma technique 
originally described by Kohler and Milstein (1975) Nature 
256:495-497, the human B cell hybridoma technique (Kozbor 

15 et al . (1983) Immunol Today 4:72), the EBV-hybridoma 

technique (Cole et al . (1985), Monoclonal Antibodies and 
Cancer Therapy, Alan R. Liss, Inc., pp. 77-96) or trioma 
techniques. The technology for producing various 
antibodies monoclonal antibody hybridomas is well known 

20 (see generally Current Protocols in Immunology (1994) 

Coligan et al . (eds.) John Wiley & Sons, Inc., New York, 
NY) . Briefly, an immortal cell line (typically a 
myeloma) is fused to lymphocytes (typically splenocytes) 
from a mammal immunized with a CARD-3 or CARD-4 immunogen 

25 as described above, and the culture supernatants of the 
resulting hybridoma cells are screened to identify a 
hybridoma producing a monoclonal antibody that binds 
CARD-3 or CARD-4 . 

Any of the many well known protocols used for 

30 fusing lymphocytes and immortalized cell lines can be 
applied for the purpose of generating an anti-CARD-3 or 
CARD-4 monoclonal antibody (see, e.g., Current Protocols 
in Immunology, supra; Galfre et al . (1977) Nature 
266:55052; R.H. Kenneth, in Monoclonal Antibodies: A New 

35 Dimension In Biological Analyses, Plenum Publishing 
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Corp., New York, New York (1980); and Lemer (1981) Yale 
J. Biol. Med., 54:387-402. Moreover, the ordinarily 
skilled worker will appreciate that there are many 
variations of such methods which also would be useful. 
5 Typically, the immortal cell line (e.g., a myeloma cell 
line) is derived from the same mammalian species as the 
lymphocytes. For example, murine hybridomas can be made 
by fusing lymphocytes from a mouse immunized with an 
immunogenic preparation of the present invention with an 

10 immortalized mouse cell line, e.g., a myeloma cell line 
that is sensitive to culture medium containing 
hypoxanthine, aminopterin and thymidine ( "HAT medium"). 
Any of a number of myeloma cell lines can be used as a 
fusion partner according to standard techniques, e.g., 

15 the P3-NSl/l-Ag4-l, P3 -x63 -Ag8 . 653 or Sp2/0-Agl4 myeloma 
lines. These myeloma lines are available from ATCC. 
Typically, HAT-sensitive mouse myeloma cells are fused to 
mouse splenocytes using polyethylene glycol ("PEG"). 
Hybridoma cells resulting from the fusion are then 

20 selected using HAT medium, which kills unfused and 

unproductively fused myeloma cells (unfused splenocytes 
die after several days because they are not transformed) . 
Hybridoma cells producing a monoclonal antibody of the 
invention are detected by screening the hybridoma culture 

25 supernatants for antibodies that bind CARD- 3 or CARD-4, 
e.g., using a standard EL ISA assay. 

Alternative to preparing monoclonal 
antibody- secreting hybridomas, a monoclonal anti-CARD-3 
or CARD-4 antibody can be identified and isolated by 

3 0 screening a recombinant combinatorial immunoglobulin 
library (e.g., an antibody phage display library) with 
CARD-3 or CARD-4 to thereby isolate immunoglobulin 
library members that bind CARD-3 or CARD-4. Kits for 
generating and screening phage display libraries are 

35 commercially available (e.g., the Pharmacia Recombinant 
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Phage Antibody System, Catalog No. 27-9400-01; and the 
Stratagene Surf ZAP Phage Display Kit, Catalog No. 
240612) . Additionally, examples of methods and reagents 
particularly amenable for use in generating and screening 
5 antibody display library can be found in, for example, 
U.S. Patent No. 5,2 23,4 09; PCT Publication No. WO 
92/18619; PCT Publication No. WO 91/17271; PCT 
Publication No. WO 92/20791; PCT Publication No. WO 
92/15679; PCT Publication No. WO 93/01288; PCT 

10 Publication No. WO 92/01047; PCT Publication No. WO 

92/09690; PCT Publication No. WO 90/02809; Fuchs et al . 
(1991) Bio/Technology 9:1370-1372; Hay et al . (1992) Hum. 
Antibod. Hybridomas 3:81-85; Huse et al . (1989) Science 
246:1275-1281; Griffiths et al . (1993) EMBO J 12:725-734. 

15 Additionally, recombinant anti-CARD-3 or CARD-4 

antibodies, such as chimeric and humanized monoclonal 
antibodies, comprising both human and non-human portions, 
which can be made using standard recombinant DNA 
techniques, are within the scope of the invention. Such 

2 0 chimeric and humanized monoclonal antibodies can be 

produced by recombinant DNA techniques known in the art, 
for example using methods described in PCT Publication 
No. WO 87/02671; European Patent Application 184,187; 
European Patent Application 171,496; European Patent 

25 Application 173,494; PCT Publication No. WO 86/01533; 
U.S. Patent No. 4,816,567; European Patent Application 
125,023; Better et al . (1988) Science 240:1041-1043; Liu 
et al. (1987) Proc . Natl. Acad. Sci . USA 84:3439-3443; 
Liu et al . (1987) J. Immunol. 139:3521-3526; Sun et al . 

30 (1987) Proc. Natl. Acad. Sci. USA 84:214-218; Nishimura 
et al. (1987) Cane. Res. 47:999-1005; Wood et al . (1985) 
Nature 314:446-449; and Shaw et al . (1988) J. Natl. 
Cancer Inst. 8 0:1553-1559); Morrison, (1985) Science 
229:1202-1207; Oi et al . (1986) Bio/Techniques 4:214; 

35 U.S. Patent 5,225,539; Jones et al . (1986) Nature 
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321:552-525; Verhoeyan et al . (1988) Science 239:1534; 
and Beidler et al . (1988) J. Immunol. 141:4053-4060. 

An anti-CARD-3 or CARD -4 antibody (e.g., 
monoclonal antibody) can be used to isolate CARD- 3 or 
CARD- 4 by standard techniques, such as affinity 
chromatography or immunoprecipitation. An ^nti-CARD-3 or 
CARD-4 antibody can facilitate the purification of 
natural CARD- 3 or CARD-4 from cells and of recombinant ly 
produced CARD-3 or CARD-4 expressed in host cells. 
Moreover, an anti-CARD-3 or CARD-4 antibody can be used 
to detect CARD-3 or CARD-4 protein (e.g., in a cellular 
lysate or cell supernatant) in order to evaluate the 
abundance and pattern of expression of the CARD-3 or 
CARD-4 protein. Anti-CARD-3 or CARD-4 antibodies can be 
used diagnostically to monitor protein levels in tissue 
as part of a clinical testing procedure, e.g., to, for 
example, determine the efficacy of a given treatment 
regimen. Detection can be facilitated by coupling the 
antibody to a detectable substance. Examples of 
detectable substances include various enzymes, prosthetic 
groups, fluorescent materials, luminescent materials, 
bioluminescent materials, and radioactive materials. 
Examples of suitable enzymes include horseradish 
peroxidase, alkaline phosphatase, S-galactosidase , or 
acetylcholinesterase; examples of suitable prosthetic 
group complexes include streptavidin/biotin and 
avidin/biotin; examples of suitable fluorescent materials 
include umbellif erone, fluorescein, fluorescein 
isothiocyanate , rhodamine , dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example 
of a luminescent material includes luminol; examples of 
bioluminescent materials include lucif erase, luciferin, 
and aequorin, and examples of suitable radioactive 
material include 125 I, 131 I, 35 S or 3 H. 
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III. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to 
vectors, preferably expression vectors, containing a 
nucleic acid encoding CARD-3 or CARD-4 (or a portion 
5 thereof) . As used herein, the term "vector" refers to a 
nucleic acid molecule capable of transporting another 
nucleic acid to which it has been linked. One type of 
vector is a "plasmid" , which refers to a circular double 
stranded DNA loop into which additional DNA segments can 

10 be ligated. Another type of vector is a viral vector, 
wherein additional DNA segments can be ligated into the 
viral genome. Certain vectors are capable of autonomous 
replication in a host cell into which they are introduced 
(e.g., bacterial vectors having a bacterial origin of 

15 replication and episomal mammalian vectors) . Other 
vectors (e.g., non-episomal mammalian vectors) are 
integrated into the genome of a host cell upon 
introduction into the host cell, and thereby are 
replicated along with the host genome. Moreover, certain 

2 0 vectors, expression vectors, are capable of directing the 
expression of genes to which they are operatively linked. 
In general, expression vectors of utility in recombinant 
DNA techniques are often in the form of plasmids 
(vectors) . However, the invention is intended to include 

25 such other forms of expression vectors, such as viral 
vectors (e.g., replication defective retroviruses, 
adenoviruses and adeno-associated viruses) , which serve 
equivalent f unct ions . 

The recombinant expression vectors of the 

30 invention comprise a nucleic acid of the invention in a 
form suitable for expression of the nucleic acid in a 
host cell, which means that the recombinant expression 
vectors include one or more regulatory sequences, 
selected on the basis of the host cells to be used for 

35 expression, which is operatively linked to the nucleic 
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acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean 
that the nucleotide sequence of interest is linked to the 
regulatory sequence (s) in a manner which allows for 
5 expression of the nucleotide sequence (e.g., in an in 
vitro transcription/translation system or in a host cell 
when the vector is introduced into the host cell) . The 
term "regulatory sequence" is intended to include 
promoters, enhancers and other expression control 

10 elements (e.g., polyadenylation signals). Such 

regulatory sequences are described, for example, in 
Goeddel; Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, CA (1990) . 
Regulatory sequences include those which direct 

15 constitutive expression of a nucleotide sequence in many 
types of host cell and those which direct expression of 
the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences) . It will be 
appreciated by those skilled in the art that the design 

2 0 of the expression vector can depend on such factors as 
the choice of the host cell to be transformed, the level 
of expression of protein desired, etc. The expression 
vectors of the invention can be introduced into host 
cells to thereby produce proteins or peptides, including 

2 5 fusion proteins or peptides, encoded by nucleic acids as 

described herein (e.g., CARD-3 or CARD-4 proteins, mutant 
forms of CARD-3 or CARD-4, fusion proteins, etc.). 

The recombinant expression vectors of the 
invention can be designed for expression of CARD-3 or 

3 0 CARD-4 in prokaryotic or eukaryotic cells, e.g., 

bacterial cells such as E. coli, insect cells (using 
baculovirus expression vectors) yeast cells or mammalian 
cells. Suitable host cells are discussed further in 
Goeddel, Gene Expression Technology: Methods in 
35 Enzymology 185, Academic Press, San Diego, CA (1990) . 
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Alternatively, the recombinant expression vector can be 
transcribed and translated in vitro, for example using T7 
promoter regulatory sequences and T7 polymerase . 

Expression of proteins in prokaryotes is most 
5 often carried out in E. coli with vectors containing 
constitutive or inducible promoters directing the 
expression of either fusion or non-fusion proteins. 
Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the 

10 recombinant protein. Such fusion vectors typically serve 
three purposes: 1) to increase expression of recombinant 
protein; 2) to increase the solubility of the recombinant 
protein; and 3) to aid in the purification of the 
recombinant protein by acting as a ligand in affinity 

15 purification. Often, in fusion expression vectors, a 
proteolytic cleavage site is introduced at the junction 
of the fusion moiety and the recombinant protein to 
enable separation of the recombinant protein from the 
fusion moiety subsequent to purification of the fusion 

2 0 protein. Such enzymes, and their cognate recognition 

sequences, include Factor Xa, thrombin and enterokinase . 
Typical fusion expression vectors include pGEX (Pharmacia 
Biotech Inc; Smith and Johnson (1988) Gene 67:31-40), 
pMAL (New England Biolabs, Beverly, MA) and pRITS 

25 (Pharmacia, Piscataway, NJ) which fuse glutathione 
S- transferase (GST) , maltose E binding protein, or 
protein A, respectively, to the target recombinant 
protein. 

Examples of suitable inducible non-fusion E. coli 
30 expression vectors include pTrc (Amann et al . , (1988) 
Gene 69:301-315) and pET lid (Studier et al . , Gene 
Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, California (1990) 60-89) . 
Target gene expression from the pTrc vector relies on 
35 host RNA polymerase transcription from a hybrid trp-lac 
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fusion promoter. Target gene expression from the pET lid 
vector relies on transcription from a T7 gnlO-lac fusion 
promoter mediated by a coexpressed viral RNA polymerase 
(T7 gnl) . This viral polymerase is supplied by host 
5 strains BL21(DE3) or HMS174 (DE3) from a resident ? 
prophage harboring a T7 gnl gene under the 
transcriptional control of the lacUV5 promoter. 

One strategy to maximize recombinant protein 
expression in E. coli is to express the protein in a host 

10 bacteria with an impaired capacity to proteolytically 
cleave the recombinant protein (Gottesman, Gene 
Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, California (1990) 119-128) . 
Another strategy is to alter the nucleic acid sequence of 

15 the nucleic acid to be inserted into an expression vector 
so that the individual codons for each amino acid are 
those preferentially utilized in E. coli (Wada et al . 
(1992) Nucleic Acids Res. 20 : 2111-2118) v Such alteration 
of nucleic acid sequences of the invention can be carried 

20 out by standard DNA synthesis techniques. 

In another embodiment, the CARD- 3 or CARD -4 
expression vector is a yeast expression vector. Examples 
of vectors for expression in yeast S. cerivisae include 
pYepSecl (Baldari et al . (1987) EMBO J. 6:229-234), pMFa 

25 (Kurjan and Herskowitz, (1982) Cell 30:933-943), pJRY88 
(Schultz et al. (1987) Gene 54:113-123), pYES2 
(Invitrogen Corporation, San Diego, CA) , pGBT9 (Clontech, 
Palo Alto, CA) , pGADIO (Clontech, Palo Alto, CA) , pYADE4 
and pYGAE2 and pYPGE2 (Brunelli and Pall, (1993) Yeast 

30 9:1299-1308), pYPGE15 (Brunelli and Pall, (1993) Yeast 

9:1309-1318), pACTII (Dr. S.E. Elledge, Baylor College of 
Medicine) , and picZ (InVitrogen Corp, San Diego, CA) . 
For example, in Example 7 the expression of a fusion 
protein comprising amino acids 1-145 of human CARD-4L 

35 fused to the DNA-binding domain of S. cerevisiae 
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transcription factor GAL4 from the yeast expression 
vector pGBT9 is described. In another example, Example 8 
describes the expression of a fusion protein comprising 
amino acids 406-953 of human CARD - 4 L fused to the 
5 DNA-binding domain of S. cerevisiae transcription factor 
GAL 4 from the yeast expression vector pGBT9 . In yet 
another example, Example 7 describes the expression of a 
fusion protein comprising CARD-3 fused to the 
transcriptional activation domain of S. cerevisiae 

10 transcription factor GAL4 from the yeast expression 
vector pACTII . 

Alternatively, CARD-3 or CARD-4 can be expressed 
in insect cells using baculovirus expression vectors. 
Baculovirus vectors available for expression of proteins 

15 in cultured insect cells (e.g., Sf 9 cells) include the 
pAc series (Smith et al . (1983) Mol . Cell Biol. 
3:2156-2165) and the pVL series (Lucklow and Summers 
(1989) Virology 170:31-39). 

In yet another embodiment, a nucleic acid of the 

2 0 invention is expressed in mammalian cells using a 
mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed (1987) Nature 
329:840), pCI (Promega) , and pMT2PC (Kaufman et al . 
(1987) EMBO J. 6:187-195). When used in mammalian cells, 

25 the expression vector's control functions are often 
provided by viral regulatory elements. For example, 
commonly used promoters are derived from polyoma, 
Adenovirus 2, cytomegalovirus and Simian Virus 40. For 
other suitable expression systems for both prokaryotic 

30 and eukaryotic cells see chapters 16 and 17 of Sambrook 
et al . (supra). For example, Example 9, Example 10, and 
Example 12 describe the expression of human CARD-4 or 
fragments therof, CARD-3, or both from the mammalian 
expression vector pCI . 
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In another embodiment, the recombinant mammalian 
expression vector is capable of directing expression of 
the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to 
5 express the nucleic acid) . Tissue-specific regulatory 
elements are known in the art. Non-limiting examples of 
suitable tissue -specif ic promoters include the albumin 
promoter ( liver- specif ic; Pinkert et al . (1987) Genes 
Dev. 1:268-277), lymphoid-specif ic promoters (Calame and 
10 Eaton (1988) Adv. Immunol. 43:235-275), in particular 
promoters of T cell receptors (Winoto and Baltimore 

(1989) EMBO J. 8:729-733) and immunoglobulins (Banerji et 
al. (1983) Cell 33:729-740; Queen and Baltimore (1983) 
Cell 33:741-748), neuron- specific promoters (e.g., the 

15 neurofilament promoter; Byrne and Ruddle (1989) Proc . 
Natl. Acad. Sci . USA 86:5473-5477), pancreas-specific 
promoters (Edlund et al . (1985) Science 230:912-916), and 
mammary gland-specific promoters (e.g., milk whey 
promoter; U.S. Patent No. 4,873,316 and European 

20 Application Publication No. 264,166). 

Developmentally- regulated promoters are also encompassed, 
for example the murine hox promoters (Kessel and Gruss 

(1990) Science 249:374-379) and the a-f etoprotein 
promoter (Campes and Tilghman (1989) Genes Dev. 

25 3:537-546) . 

The invention further provides a recombinant 
expression vector comprising a DNA molecule of the 
invention cloned into the expression vector in an 
antisense orientation. That is, the DNA molecule is 

3 0 operatively linked to a regulatory sequence in a manner 
which allows for expression (by transcription of the DNA 
molecule) of an RNA molecule which is antisense to CARD-3 
or CARD-4 mRNA. Regulatory sequences operatively linked 
to a nucleic acid cloned in the antisense orientation can 

35 be chosen which direct the continuous expression of the 
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antisense RNA molecule in a variety of cell types, for 
instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen which direct constitutive, tissue 
specific or cell type specific expression of antisense 
5 RNA. The antisense expression vector can be in the form 
of a recombinant plasmid, phagemid or attenuated virus in 
which antisense nucleic acids are produced under the 
control of a high efficiency regulatory region, the 
activity of which can be determined by the cell type into 
10 which the vector is introduced. For a discussion of the 
regulation of gene expression using antisense genes see 
Weintraub et al . (Reviews - Trends in Genetics, Vol. 1(1) 
1986) . 

Another aspect of the invention pertains to host 
15 cells into which a recombinant expression vector of the 
invention or isolated nucleic acid molecule of the 
invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. 
It is understood that such terms refer not only to the 
20 particular subject cell but to the progeny or potential 
progeny of such a cell. Because certain modifications 
may occur in succeeding generations due to either 
mutation or environmental influences, such progeny may 
not, in fact, be identical to the parent cell, but are 

2 5 still included within the scope of the term as used 

herein. 

A host cell can be any prokaryotic or eukaryotic 
cell. For example, CARD-3 or CARD-4 protein can be 
expressed in bacterial cells such as E. coli, insect 

3 0 cells, yeast or mammalian cells (such as Chinese hamster 

ovary cells (CHO) or COS cells) . Other suitable host 
cells are known to those skilled in the art. For 
example, in Example 7 a Saccharomyces cerevisiae host 
cell for recombinant CARD-4 and CARD-3 expression is 
3 5 described and in Examples 9, 10, and 12 2 93T host cells 



WO 99/40102 



PCT/US99/02544 



- 66 - 

for expression of CARD4 or fragments thereof or CARD- 3 
are described. 

Vector DNA or an isolated nucleic acid molecule of 
the invention can be introduced into prokaryotic or 
5 eukaryotic cells via conventional transformation or 
transfection techniques. As used herein, the terms 
11 transformation" and "transfection" are intended to refer 
to a variety of art -recognized techniques for introducing 
foreign nucleic acid (e.g., DNA) into a host cell, 

10 including calcium phosphate or calcium chloride 

co-precipitation, DEAE-dext ran- mediated transfection, 
lipofection, or electroporation . Suitable methods for 
transforming or transfecting host cells can be found in 
Sambrook, et al . (supra), and other laboratory manuals. 

15 For stable transfection of mammalian cells, it is 

known that, depending upon the expression vector and 
transfection technique used, only a small fraction of 
cells may integrate the foreign DNA into their genome. 
IN some cases vector DNA is retained by the host cell. 

2 0 In other cases the host cell does not retain vector DNA 
and retains only an isolated nucleic acid molecule of the 
invention carried by the vector. In some cases, and 
isolated nucleic acid molecule of the invention is used 
to transform a cell without the use of a vector. 

25 In order to identify and select these integrants, 

a gene that encodes a selectable marker (e.g., resistance 
to antibiotics) is generally introduced into the host 
cells along with the gene of interest. Preferred 
selectable markers include those which confer resistance 

30 to drugs, such as G418, hygromycin and methotrexate. 
Nucleic acid encoding a selectable marker can be 
introduced into a host cell on the same vector as that 
encoding CARD- 3 or CARD-4 or can be introduced on a 
separate vector. Cells stably transfected with the 

35 introduced nucleic acid can be identified by drug 
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selection (e.g., cells that have incorporated the 
selectable marker gene will survive, while the other 
cells die) . 

A host cell of the invention, such as a 
5 prokaryotic or eukaryotic host cell in culture, can be 
used to produce a (i.e., express) CARD-3 or CARD-4 
protein. Accordingly, the invention further provides 
methods for producing CARD-3 or CARD-4 protein using the 
host cells of the invention. In one embodiment, the 

10 method comprises culturing the host cell of the invention 
(into which a recombinant expression vector or isolated 
nucleic acid molecule encoding CARD-3 or CARD-4 has been 
introduced) in a suitable medium such that CARD-3 or 
CARD-4 protein is produced. In another embodiment, the 

15 method further comprises isolating CARD-3 or CARD-4 from 
the medium or the host cell. 

The host cells of the invention can also be used 
to produce nonhuman transgenic animals. For example, in 
one embodiment, a host cell of the invention is a 

20 fertilized oocyte or an embryonic stem cell into which 
CARD-3 or CARD-4 -coding sequences have been introduced. 
Such host cells can then be used to create non-human 
transgenic animals in which exogenous CARD-3 or CARD-4 
sequences have been introduced into their genome or 

2 5 homologous recombinant animals in which endogenous CARD-3 
or CARD-4 sequences have been altered. Such animals are 
useful for studying the function and/or activity of 
CARD-3 or CARD-4 and for identifying and/or evaluating 
modulators of CARD-3 or CARD-4 activity. As used herein, 

30 a "transgenic animal" is a non-human animal, preferably a 
mammal, more preferably a rodent such as a rat or mouse, 
in which one or more of the cells of the animal includes 
a transgene. Other examples of transgenic animals 
include non-human primates, sheep, dogs, cows, goats, 

35 chickens, amphibians, etc. A transgene is exogenous DNA 
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which is integrated into the genome of a cell from which 
a transgenic animal develops and which remains in the 
genome of the mature animal, thereby directing the 
expression of an encoded gene product in one or more cell 
5 types or tissues of the transgenic animal . As used 

herein, an "homologous recombinant animal" is a non-human 
animal, preferably a mammal, more preferably a mouse, in 
which an endogenous CARD- 3 or CARD-4 gene has been 
altered by homologous recombination between the 

10 endogenous gene and an exogenous DNA molecule introduced 
into a cell of the animal, e.g., an embryonic cell of the 
animal , prior to development of the animal . 

A transgenic animal of the invention can be 
created by introducing CARD-3 or CARD-4 -encoding nucleic 

15 acid into the male pronuclei of a fertilized oocyte, 
e.g., by microinjection, retroviral infection, and 
allowing the oocyte to develop in a pseudopregnant female 
foster animal. The CARD-3 or CARD-4 cDNA sequence e.g., 
that of (SEQ ID NO : 1 , SEQ ID NO : 3 , SEQ ID NO : 7 , SEQ ID 

20 NO: 9, SEQ ID N0:25, SEQ ID NO:27, SEQ ID NO:38, SEQ ID 

NO: 40, SEQ ID NO: 42 or the cDNA of ATCC , or the 

cDNA of ATCC , or the cDNA of ATCC ) can be 

introduced as a transgene into the genome of a non-human 
animal. Alternatively, a nonhuman homolog or ortholog of 

25 the human CARD-3 or CARD-4 gene, such as a mouse CARD-3 
or CARD-4 gene, can be isolated based on hybridization to 
the human CARD-3 or CARD-4 cDNA and used as a transgene. 
For example, the mouse ortholog of CARD-4, Figure 15 and 
SEQ ID NO: 42 can be used to make a transgenic animal 

30 using standard methods. Intronic sequences and 

polyadenylation signals can also be included in the 
transgene to increase the efficiency of expression of the 
transgene. A tissue-specific regulatory sequence(s) can 
be operably linked to the CARD-3 or CARD-4 transgene to 

35 direct expression of CARD-3 or CARD-4 protein to 
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particular cells. Methods for generating transgenic 
animals via embryo manipulation and microinjection, 
particularly animals such as mice, have become 
conventional in the art and are described, for example, 
5 in U.S. Patent Nos . 4,736,866 and 4,870,009, U.S. Patent 
No. 4,873,191 and in Hogan, Manipulating the Mouse 
Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y., 1986) . Similar methods are used for 
production of other transgenic animals. A transgenic 

10 founder animal can be identified based upon the presence 
of the CARD-3 or CARD-4 transgene in its genome and/or 
expression of CARD-3 or CARD-4 mRNA in tissues or cells 
of the animals. A transgenic founder animal can then be 
used to breed additional animals carrying the transgene. 

15 Moreover, transgenic animals carrying a transgene 

encoding CARD-3 or CARD-4 can further be bred to other 
transgenic animals carrying other transgenes . 

To create an homologous recombinant animal, a 
vector is prepared which contains at least a portion of a 

20 CARD-3 or CARD-4 gene (e.g., a human or a non-human 
homolog of the CARD-3 or CARD-4 gene, e.g., a murine 
CARD-3 or CARD-4 gene) into which a deletion, addition or 
substitution has been introduced to thereby alter, e.g., 
functionally disrupt, the CARD-3 or CARD-4 gene. In an 

25 embodiment, the vector is designed such that, upon 

homologous recombination, the endogenous CARD-3 or CARD-4 
gene is functionally disrupted (i.e., no longer encodes a 
functional protein; also referred to as a "knock out" 
vector) . Alternatively, the vector can be designed such 

3 0 that, upon homologous recombination, the endogenous 

CARD-3 or CARD-4 gene is mutated or otherwise altered but 
still encodes functional protein (e.g., the upstream 
regulatory region can be altered to thereby alter the 
expression of the endogenous CARD-3 or CARD-4 protein) . 

35 In the homologous recombination vector, the altered 
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portion of the CARD -3 or CARD -4 gene is flanked at its 5' 
and 3' ends by additional nucleic acid of the CARD-3 or 
CARD-4 gene to allow for homologous recombination to 
occur between the exogenous CARD-3 or CARD-4 gene carried 
5 by the vector and an endogenous CARD-3 or CARD-4 gene in 
an embryonic stem cell. The additional flanking CARD-3 
or CARD-4 nucleic acid is of sufficient length for 
successful homologous recombination with the endogenous 
gene. Typically, several kilobases of flanking DNA (both 

10 at the 5' and 3' ends) are included in the vector (see, 
e.g., Thomas and Capecchi (1987) Cell 51:503 for a 
description of homologous recombination vectors) . The 
vector is introduced into an embryonic stem cell line 
(e.g., by electroporation) and cells in which the 

15 introduced CARD-3 or CARD-4 gene has homologously 

recombined with the endogenous CARD-3 or CARD-4 gene are 
selected (see, e.g., Li et al . (1992) Cell 69:915). The 
selected cells are then injected into a blastocyst of an 
animal (e.g., a mouse) to form aggregation chimeras (see, 

20 e.g., Bradley in Teratocarcinomas and .Embryonic Stem 

Cells: A Practical Approach, Robertson, ed. (IRL, Oxford, 
1987) pp. 113-152) . A chimeric embryo can then be 
implanted into a suitable pseudopregnant female foster 
animal and the embryo brought to term. Progeny harboring 

25 the homologously recombined DNA in their germ cells can 
be used to breed animals in which all cells of the animal 
contain the homologously recombined DNA by germline 
transmission of the transgene. Methods for constructing 
homologous recombination vectors and homologous 

30 recombinant animals are described further in Bradley 

(1991) Current Opinion in Bio/Technology 2:823-829 and in 
PCT Publication Nos . WO 90/11354, WO 91/01140, WO 
92/0968, and WO 93/04169. 

In another embodiment, transgenic non-humans 

35 animals can be produced which contain selected systems 
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which allow for regulated expression of the transgene. 
One example of such a system is the cre/loxP recombinase 
system of bacteriophage PI. For a description of the 
cre/loxP recombinase system, see, e.g., Lakso et al . 
5 (1992) Proc. Natl. Acad. Sci . USA 89:6232-623 6. Another 
example of a recombinase system is the FLP recombinase 
system of Saccharomyces cerevisiae (O' Gorman et al . 
(1991) Science 251:1351-1355. If a cre/loxP recombinase 
system is used to regulate expression of the transgene, 

10 animals containing transgenes encoding both the Cre 

recombinase and a selected protein are required. Such 
animals can be provided through the construction of 
"double" transgenic animals, e.g., by mating two 
transgenic animals, one containing a transgene encoding a 

15 selected protein and the other containing a transgene 
encoding a recombinase. 

Clones of the non-human transgenic animals 
described herein can also be produced according to the 
methods described in Wilmut et al . (1997) Nature 

20 385:810-813 and PCT Publication Nos . WO 97/07668 and WO 
97/07669. In brief, a cell, e.g., a somatic cell, from 
the transgenic animal can be isolated and induced to exit 
the growth cycle and enter Go phase. The quiescent cell 
can then be fused, e.g., through the use of electrical 

25 pulses, to an enucleated oocyte from an animal of the 
same species from which the quiescent cell is isolated. 
The reconstructed oocyte is then cultured such that it 
develops to morula or blastocyte and then transferred to 
pseudopregnant female foster animal. The offspring borne 

3 0 of this female foster animal will be a clone of the 

animal from which the cell, e.g., the somatic cell, is 
isolated . 
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IV. Pharmaceutical Compositions 

The CARD-3 or CARD-4 nucleic acid molecules, 
CARD- 3 or CARD-4 proteins, and anti-CARD-3 or CARD-4 
antibodies (also referred to herein as "active 
5 compounds") of the invention can be incorporated into 
pharmaceutical compositions suitable for administration. 
Such compositions typically comprise the nucleic acid 
molecule, protein, or antibody and a pharmaceutically 
acceptable carrier. As used herein the language 

10 "pharmaceutically acceptable carrier" is intended to 

include any and all solvents, dispersion media, coatings, 
antibacterial and antifungal agents, isotonic and 
absorption delaying agents, and the like, compatible with 
pharmaceutical administration. The use of such media and 

15 agents for pharmaceutically active substances is well 
known in the art. Except insofar as any conventional 
media or agent is incompatible with the active compound, 
use thereof in the compositions is contemplated. 
Supplementary active compounds can also be incorporated 

20 into the compositions. 

A pharmaceutical composition of the invention is 
formulated to be compatible with its intended route of 
administration. Examples of routes of administration 
include parenteral, e.g., intravenous, intradermal, 

25 subcutaneous, oral (e.g., inhalation), transdermal 
(topical), transmucosal , and rectal administration. 
Solutions or suspensions used for parenteral, 
intradermal , or subcutaneous application can include the 
following components: a sterile diluent such as water for 

30 injection, saline solution, fixed oils, polyethylene 

glycols, glycerine, propylene glycol or other synthetic 
solvents; antibacterial agents such as benzyl alcohol or 
methyl parabens; antioxidants such as ascorbic acid or 
sodium bisulfite; chelating agents such as 

35 ethylenediaminetetraacetic acid; buffers such as 
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acetates, citrates or phosphates and agents for the 
adjustment of tonicity such as sodium chloride or 
dextrose. pH can be adjusted with acids or bases, such 
as hydrochloric acid or sodium hydroxide. The parenteral 
5 preparation can be enclosed in ampoules, disposable 

syringes or multiple dose vials made of glass or plastic. 

Pharmaceutical compositions suitable for 
injectable use include sterile aqueous solutions (where 
water soluble) or dispersions and sterile powders for the 

10 extemporaneous preparation of sterile injectable 

solutions or dispersion. For intravenous administration, 
suitable carriers include physiological saline, 
bacteriostatic water, Cremophor EL? (BASF; Parsippany, 
NJ) or phosphate buffered saline (PBS) . In all cases, 

15 the composition must be sterile and should be fluid to 
the extent that easy syringability exists. It must be 
stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action of 
microorganisms such as bacteria and fungi. The carrier 

2 0 can be a solvent or dispersion medium containing, for 
example, water, ethanol, polyol (for example, glycerol, 
propylene glycol, and liquid polyetheylene glycol, and 
the like), and suitable mixtures thereof. The proper 
fluidity can be maintained, for example, by the use of a 

25 coating such as lecithin, by the maintenance of the 

required particle size in the case of dispersion and by 
the use of surfactants. Prevention of the action of 
microorganisms can be achieved by various antibacterial 
and antifungal agents, for example, parabens, 

30 chlorobutanol , phenol, ascorbic acid, thimerosal , and the 
like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars, polyalcohols such 
as mannitol, sorbitol, sodium chloride in the 
composition. Prolonged absorption of the injectable 
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compositions can be brought about by including in the 
composition an agent which delays absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by 
5 incorporating the active compound (e.g., a CARD-3 or 

CARD-4 protein or ant i -CARD-3 or CARD-4 antibody) in the 
required amount in an appropriate solvent with one or a 
combination of ingredients enumerated above, as required, 
followed by filtered sterilization. Generally, 

10 dispersions are prepared by incorporating the active 
compound into a sterile vehicle which contains a basic 
dispersion medium and the required other ingredients from 
those enumerated above. In the case of sterile powders 
for the preparation of sterile injectable solutions, the 

15 preferred methods of preparation are vacuum drying and 
freeze -drying which yields a powder of the active 
ingredient plus any additional desired ingredient from a 
previously sterile- filtered solution thereof. 

Oral compositions generally include an inert 

2 0 diluent or an edible carrier. They can be enclosed in 
gelatin capsules or compressed into tablets. For the 
purpose of oral therapeutic administration, the active 
compound can be incorporated with excipients and used in 
the form of tablets, troches, or capsules. Oral 

25 compositions can also be prepared using a fluid carrier 
for use as a mouthwash, wherein the compound in the fluid 
carrier is applied orally and swished and expectorated or 
swallowed. Pharmaceutically compatible binding agents, 
and/or adjuvant materials can be included as part of the 

30 composition. The tablets, pills, capsules, troches and 
the like can contain any of the following ingredients, or 
compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an 
excipient such as starch or lactose, a disintegrating 

35 agent such as alginic acid, Primogel, or corn starch; a 
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lubricant such as magnesium stearate or Sterotes; a 
glidant such as colloidal silicon dioxide; a sweetening 
agent such as sucrose or saccharin; or a flavoring agent 
such as peppermint, methyl salicylate, or orange 
5 flavoring. For administration by inhalation, the 

compounds are delivered in the form of an aerosol spray 
from pressured container or dispenser which contains a 
suitable propellant, e.g., a gas such as carbon dioxide, 
or a nebulizer. 

10 Systemic administration can also be by 

transmucosal or transdermal means. For transmucosal or 
transdermal administration, penetrants appropriate to the 
barrier to be permeated are used in the formulation. 
Such penetrants are generally known in the art, and 

15 include, for example, for transmucosal administration, 
detergents, bile salts, and fusidic acid derivatives. 
Transmucosal administration can be accomplished through 
the use of nasal sprays or suppositories. For 
transdermal administration, the active compounds are 

20 formulated into ointments, salves, gels, or creams as 
generally known in the art. 

The compounds can also be prepared in the form of 
suppositories (e.g., with conventional suppository bases 
such as cocoa butter and other glycerides) or retention 

2 5 enemas for rectal delivery. 

In one embodiment, the active compounds are 
prepared with carriers that will protect the compound 
against rapid elimination from the body, such as a 
controlled release formulation, including implants and 

3 0 microencapsulated delivery systems. Biodegradable, 

biocompatible polymers can be used, such as ethylene 
vinyl acetate, polyanhydrides , polyglycolic acid, 
collagen, polyorthoesters , and polylactic acid. Methods 
for preparation of such formulations will be apparent to 
35 those skilled in the art. The materials can also be 
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obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including 
liposomes targeted to infected cells with monoclonal 
antibodies to viral antigens) can also be used as 
5 pharmaceutical^ acceptable carriers. These can be 

prepared according to methods known to those skilled in 
the art, for example, as described in U.S. Patent No. 
4,522,811. 

It is especially advantageous to formulate oral or 

10 parenteral compositions in dosage unit form for ease of 
administration and uniformity of dosage. Dosage unit 
form as used herein refers to physically discrete units 
suited as unitary dosages for the subject to be treated; 
each unit containing a predetermined quantity of active 

15 compound calculated to produce the desired therapeutic 
effect in association with the required pharmaceutical 
carrier. The specification for the dosage unit forms of 
the invention are dictated by and directly dependent on 
the unique characteristics of the active compound and the 

20 particular therapeutic effect to be achieved, and the 
limitations inherent in the art of compounding such an 
active compound for the treatment of individuals. 

The nucleic acid molecules of the invention can be 
inserted into vectors and used as gene therapy vectors. 

25 Gene therapy vectors can be delivered to a subject by, 

for example, intravenous injection, local administration 
(U.S. Patent 5,328,470) or by stereotactic injection 
(see, e.g., Chen et al . (1994) Proc. Natl. Acad. Sci . USA 
91:3054-3057). The pharmaceutical preparation of the 

3 0 gene therapy vector can include the gene therapy vector 
in an acceptable diluent, or can comprise a slow release 
matrix in which the gene delivery vehicle is imbedded. 
Alternatively, where the complete gene delivery vector 
can be produced intact from recombinant cells, e.g. 

35 retroviral vectors, the pharmaceutical preparation can 
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include one or more cells which produce the gene delivery- 
system. 

The pharmaceutical compositions can be included in 
a container, pack, or dispenser together with 
instructions for administration. 

V, Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein 
homologues, and antibodies described herein can be used 
in one or more of the following methods: a) screening 
assays; b) detection assays (e.g., chromosomal mapping, 
tissue typing, forensic biology) , c) predictive medicine 
(e.g., diagnostic assays, prognostic assays, monitoring 
clinical trials, and pharmacogenomics) ; and d) methods of 
treatment (e.g., therapeutic and prophylactic). A CARD-3 
or CARD-4 protein interacts with other cellular proteins 
and can thus be used for (i) regulation of cellular 
proliferation; (ii) regulation of cellular 
differentiation; and (iii) regulation of cell survival. 
The isolated nucleic acid molecules of the invention can 
be used to express CARD-3 or CARD-4 protein (e.g., via a 
recombinant expression vector in a host cell in gene 
therapy applications) , to detect CARD-3 or CARD-4 mRNA 
(e.g w in a biological sample) or a genetic lesion in a 
CARD-3 or CARD-4 gene, and to modulate CARD-3 or CARD-4 
activity. In addition, the CARD-3 or CARD-4 proteins can 
be used to screen drugs or compounds which modulate the 
CARD-3 or CARD-4 activity or expression as well as to 
treat disorders characterized by insufficient or 
excessive production of CARD-3 or CARD-4 protein or 
production of CARD-3 or CARD-4 protein forms which have 
decreased or aberrant activity compared to CARD-3 or 
CARD-4 wild type protein. In addition, the anti-CARD-3 
or CARD-4 antibodies of the invention can be used to 
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detect and isolate CARD-3 or CARD-4 proteins and modulate 
CARD -3 or CARD-4 activity. 

This invention further pertains to novel agents 
identified by the above -described screening assays and 
5 uses thereof for treatments as described herein. 

A. Screening Assays 

The invention provides a method (also referred to 
herein as a "screening assay") for identifying 
modulators, i.e., candidate or test compounds or agents 

10 (e.g., peptides, peptidomimetics , small molecules or 

other drugs) which bind to CARD-3 or CARD-4 proteins or 
biologically active portions thereof or have a 
stimulatory or inhibitory effect on, for example, CARD-3 
or CARD-4 expression or CARD-3 or CARD-4 activity. An 

15 example of a biologically active portion of human CARD-4 
is amino acids 1-145 encoding the CARD domain which is 
sufficient to exhibit CARD-3 -binding activity as 
described in Example 7. Amino acids 406-953 of human 
CARD4L comprising the L.RR domain represent a biologically 

2 0 active port ion of CARD-4L because they possess 

hNUDC-binding activity as described in Example 8. 

In one embodiment, the invention provides assays 
for screening candidate or test compounds which bind to 
or modulate the activity of a CARD-3 or CARD-4 proteins 

25 or polypeptides or biologically active portions thereof. 
The test compounds of the present invention can be 
obtained using any of the numerous approaches in 
combinatorial library methods known in the art, 
including: biological libraries; spatially addressable 

30 parallel solid phase or solution phase libraries; 

synthetic library methods requiring deconvolution; the 
"one-bead one -compound" library method; and synthetic 
library methods using affinity chromatography selection. 
The biological library approach is limited to peptide 



WO 99/40102 



PCT/US99/02544 



- 79 - 

libraries, while the other four approaches are applicable 
to peptide, non-peptide oligomer or small molecule 
libraries of compounds (Lam (1997) Anticancer Drug Des. 
12:145) . Examples of methods for the synthesis of 
5 molecular libraries can be found in the art, for example 
in: DeWitt et al . (1993) Proc . Natl. Acad. Sci . U.S.A. 
90:6909; Erb et al . (1994) Proc. Natl. Acad. Sci. USA 
91:11422; Zuckermann et al . (1994). J. Med. Chem. 
37:2678; Cho et al . (1993) Science 261:1303; Carrell et 

10 al . (1994) Angew. Chem. Int. Ed. Engl. 33:2059; Carell et 
al . (1994) Angew. Chem. Int. Ed. Engl. 33:2061; and 
Gallop et al. (1994) J. Med. Chem. 37:1233. 

Libraries of compounds may be presented in 
solution (e.g., Houghten (1992) Bio/Techniques 

15 13:412-421), or on beads (Lam (1991) Nature 354:82-84), 
chips (Fodor (1993) Nature 364:555-556), bacteria (U.S. 
Patent No. 5,223,409), spores (Patent Nos . 5,571,698; 
5,403,484; and 5,223,409), plasmids (Cull et al . (1992) 
Proc. Natl. Acad. Sci. USA 89:1865-1869) or on phage 

20 (Scott and Smith (1990) Science 249:386-390; Devlin 

(1990) Science 249:404-406; Cwirla et al . (1990) Proc. 
Natl. Acad. Sci. 87:6378-6382; and Felici (1991) J. Mol . 
Biol . 222 :301-310) . 

Determining the ability of the test compound to 

25 modulate the activity of CARD-3 or CARD-4 or a 

biologically active portion thereof can be accomplished, 
for example, by determining the ability of the CARD-3 or 
CARD-4 protein to bind to or interact with a CARD-3 or 
CARD-4 target molecule. As used herein, a "target 

30 molecule" is a molecule with which a CARD-3 or CARD-4 
protein binds or interacts in nature, for example, a 
molecule associated with the internal surface of a cell 
membrane or a cytoplasmic molecule. A CARD-3 or CARD-4 
target molecule can be a non-CARD-3 or CARD-4 molecule or 

35 a CARD-3 or CARD-4 protein or polypeptide of the present 
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invention. In one embodiment, a CARD -3 or CARD -4 target 
molecule is a component of an apoptotic signal 
transduction pathway, e.g., CARD-3 and CARD- 4 . The 
target, for example, can be a second intracellular 
5 protein which has catalytic activity or a protein which 
facilitates the association of downstream signaling 
molecules with CARD-3 or CARD-4. In another embodiment, 
CARD-3 or CARD-4 target molecules include CARD-3 because 
CARD-3 was found to bind to CARD-4 (Examples 7 and 12) 
10 and hNUDC because hNUDC was found to bind to CARD-4 
(Example 8) . 

Determining the ability of the test compound to 
modulate the activity of CARD-3 or CARD-4 or a 
biologically active portion thereof can be accomplished, 

15 for example, by determining the ability of the CARD-3 or 
CARD-4 protein to bind to or interact with any of the 
specific proteins listed in the previous paragraph as 
CARD-3 or CARD-4 target molecules. In another 
embodiment, CARD-3 or CARD-4 target molecules include all 

20 proteins that bind to a CARD-3 or CARD-4 protein or 
fragment thereof in a two-hybrid system binding assay 
which can be used without undue experimentation to 
isolate such proteins from cDNA or genomic two-hybrid 
system libraries. For example, Example 7 describes the 

25 use of the CARD-4 CARD domain region to identify CARD-3 
in a two-hybrid screen and Example 8 describes the use of 
the CARD-4 LRR region to identify hNUDC in a two-hybrid 
screen. The binding assays described in this section 
could be cell -based or cell free (described 

30 subsequently) . 

Determining the ability of the CARD-3 or CARD-4 
protein to bind to or interact with a CARD-3 or CARD-4 
target molecule can be accomplished by one of the methods 
described above for determining direct binding. In an 

35 embodiment, determining the ability of the CARD-3 or 
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CARD-4 protein to bind to or interact with a CARD-3 or 
CARD-4 target molecule can be accomplished by determining 
the activity of the target molecule. For example, the 
activity of the target molecule can be determined by 
5 detecting induction of a cellular second messenger of the 
target (e.g., intracellular Ca2+, diacylglycerol , IP3, 
etc.), detecting catalytic/enzymatic activity of the 
target on an appropriate substrate, detecting the 
induction of a reporter gene (e.g., a CARD-3 or 

10 CARD-4 -responsive regulatory element operatively linked 
to a nucleic acid encoding a detectable marker, e.g. 
lucif erase) , or detecting a cellular response, for 
example, cell survival, cellular differentiation, or cell 
proliferation. For example, in Example 12 CARD-4 is 

15 shown to bind to CARD-3 and in Example 10, by monitoring 
a cellular response, CARD-4 is shown to enhance caspase 9 
activity, cell death or apoptosis. Because CARD-3 and 
CARD-4 enhance caspase 9 activity, CARD-3 or CARD-4 
activity can be monitored by assaying the caspase 

20 9-mediated apoptosis cellular response or caspase 9 
enzymatic activity. In addition, and in another 
embodiment, genes induced by CARD-3 or CARD-4 expression 
could be identified by expressing CARD-3 or CARD-4 in a 
cell line and conducting a transcriptional profiling 

2 5 experiment wherein the mRNA expression patterns of the 

cell line transformed with an empty expression vector and 
the cell line transformed with a CARD-3 or CARD-4 
expression vector are compared. The promoters of genes 
induced by CARD-3 or CARD-4 expression could be 

3 0 operatively linked to reporter genes suitable for 

screening such as lucif erase, secreted alkaline 
phosphatase, or beta-galactosidase and the resulting 
constructs could be introduced into appropriate 
expression vectors. A recombinant cell line containing 
35 CARD-3 or CARD-4 and transfected with an expression 
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vector containing a CARD - 3 or CARD- 4 responsive promoter 
operatively linked to a reporter gene could be used to 
identify test compounds that modulate CARD-3 or CARD-4 
activity by assaying the expression of the reporter gene 
5 in response to contacting the recombinant cell line with 
test compounds. CARD-3 or CARD-4 agonists can be 
identified as increasing the expression of the reporter 
gene and CARD-3 or CARD-4 antagonists can be identified 
as decreasing the expression of the reporter gene. 

10 In another embodiment of the invention, the 

ability of a test compound to modulate the activity of 
CARD-3, CARD-4, or biologically active portions thereof 
can be determined by assaying the ability of the test 
compound to modulate CARD-3 or CARD-4 -dependent pathways 

15 or processes where the CARD-3 or CARD-4 target proteins 
that mediate the CARD-3 or CARD-4 effect are known or 
unknown. Potential CARD-3 or CARD-4 -dependent pathways 
or processes include but are not limited to the 
modulation of cellular signal transduction pathways and 

20 their related second messenger molecules (e.g., 

intracellular Ca2 + , diacylglycerol , IP3, cAMP etc.), 
cellular enzymatic activities, cellular responses (e . g . , 
cell survival, cellular differentiation, or cell 
proliferation) , or the induction or repression of 

25 cellular or heterologous mRNAs or proteins. CARD-3 or 
CARD-4 -dependent pathways or processes could be assayed 
by standard cell -based or cell free assays appropriate 
for the specific pathway or process under study. For 
example, Example 9 describes how expression of CARD- 4 S or 

30 CARD- 4 L in 293T cells induces the NF-kB pathway as 

determined by the measurement of a cotransf ected NF-kB 
pathway lucif erase reporter gene. In another embodiment, 
cells cotransf ected with CARD-4 and the NF-kB lucif erase 
reporter gene could be contacted with a test compound and 

35 test compounds that block CARD-4 activity could be 
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identified by their reduction of CARD- 4 -dependent NF-kB 
pathway lucif erase reporter gene expression. Test 
compounds that agonize CARD-4 would be expected to 
increase reporter gene expression. In another 
5 embodiment, CARD-4 could be expressed in a cell line and 
the recombinant CARD-4 -expressing cell line could be 
contacted with a test compound. Test compounds that 
inhibit CARD-4 activity could be indentified by their 
reduction of CARD- 4 -depended NF-kB pathway stimulation as 

10 measured by the assay of a NF-kB pathway reporter gene, 
NF-kB nuclear localization, IkB phosphorylation or 
proteolysis, or other standard assays for NF-kB pathway 
activation known to those skilled in the art. 

In yet another embodiment, an assay of the present 

15 invention is a cell -free assay comprising contacting a 
CARD- 3 or CARD-4 protein or biologically active portion 
thereof with a test compound and determining the ability 
of the test compound to bind to the CARD- 3 or CARD-4 
protein or biologically active portion thereof. Binding 

20 of the test compound to the CARD-3 or CARD-4 protein can 
be determined either directly or indirectly as described 
above. In one embodiment/ a competitive binding assay 
includes contacting the CARD-3 or CARD-4 protein or 
biologically active portion thereof with a compound known 

2 5 to bind CARD-3 or CARD-4 to form an assay mixture, 

contacting the assay mixture with a test compound, and 
determining the ability of the test compound to interact 
with a CARD-3 or CARD-4 protein, wherein determining the 
ability of the test compound to interact with a CARD-3 or 

3 0 CARD-4 protein comprises determining the ability of the 

test compound to preferentially bind to CARD-3 or CARD-4 
or biologically active portion thereof as compared to the 
known binding compound. 

In another embodiment, an assay is a cell -free 
35 assay comprising contacting CARD-3 or CARD-4 protein or 
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biologically active portion thereof with a test compound 
and determining the ability of the test compound to 
modulate (e.g., stimulate or inhibit) the activity of the 
CARD-3 or CARD-4 protein or biologically active portion 
5 thereof. Determining the ability of the test compound to 
modulate the activity of CARD-3 or CARD-4 can be 
accomplished, for example, by determining the ability of 
the CARD-3 or CARD-4 protein to bind to a CARD-3 or 
CARD-4 target molecule by one of the methods described 

10 above for determining direct binding. In an alternative 
embodiment, determining the ability of the test compound 
to modulate the activity of CARD-3 or CARD-4 can be 
accomplished by determining the ability of the CARD-3 or 
CARD-4 protein to further modulate a CARD-3 or CARD-4 

15 target molecule. For example, the catalytic/enzymatic 
activity of the target molecule on an appropriate 
substrate can be determined as previously described. 

In yet another embodiment, the cell -free assay 
comprises contacting the CARD-3 or CARD-4 protein or 

2 0 biologically active portion thereof with a known compound 

which binds CARD-3 or CARD-4 to form an assay mixture, 
contacting the assay mixture with a test compound, and 
determining the ability of the test compound to interact 
with a CARD-3 or CARD-4 protein, wherein determining the 
25 ability of the test compound to interact with a CARD-3 or 
CARD-4 protein comprises determining the ability of the 
CARD-3 or CARD-4 protein to preferentially bind to or 
modulate the activity of a CARD-3 or CARD-4 target 
molecule. The cell -free assays of the present invention 

3 0 are amenable to use of both the soluble form or the 

membrane-associated form of CARD-3 or CARD-4. A 
membrane-associated form of CARD-3 or CARD-4 refers to 
CARD-3 or CARD-4 that interacts with a membrane -bound 
target molecule. In the case of cell-free assays 
35 comprising the membrane-associated form of CARD-3 or 
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CARD-4, it may be desirable to utilize a solubilizing 
agent such that the membrane-associated form of CARD-3 or 
CARD-4 is maintained in solution. Examples of such 
solubilizing agents include non-ionic detergents such as 
5 n-octylglucoside , n-dodecylglucoside , n-dodecylmaltoside , 
octanoyl -N-methylglucamide , decanoyl -N-methylglucamide , 
Triton® X-100, Triton® X-114, Thesit®, 
Isotridecypoly (ethylene glycol ether) n, 
3- [ (3 -cholamidopropyl) dimethylamminio] -1 -propane 
10 sulfonate (CHAPS ) , 

3- [ (3 -cholamidopropyl ) dimethylamminio] -2 -hydroxy- 1 -propan 
e sulfonate (CHAPSO) , or 

N-dodecyl=N, N-dimethyl - 3 -ammonio- 1 -propane sulfonate . 

In more than one embodiment of the above assay 

15 methods of the present invention, it may be desirable to 
immobilize either CARD-3 or CARD-4 or its target molecule 
to facilitate separation of complexed from uncomplexed 
forms of one or both of the proteins, as well as to 
accommodate automation of the assay. Binding of a test 

20 compound to CARD-3 or CARD-4, or interaction of CARD-3 or 
CARD-4 with a target molecule in the presence and absence 
of a candidate compound, can be accomplished in any 
vessel suitable for containing the reactants. Examples 
of such vessels include microtitre plates, test tubes, 

25 and micro-centrifuge tubes. In one embodiment, a fusion 
protein can be provided which adds a domain that allows 
one or both of the proteins to be bound to a matrix. For 
example, glutathione-S-transf erase/ CARD-3 or CARD-4 
fusion proteins or glutathione-S- transf erase/target 

3 0 fusion proteins can be adsorbed onto glutathione 
sepharose beads (Sigma Chemical; St. Louis > MO) or 
glutathione derivatized microtitre plates, which are then 
combined with the test compound or the test compound and 
either the non-adsorbed target protein or CARD-3 or 

3 5 CARD-4 protein, and the mixture incubated under 
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conditions conducive to complex formation (e.g., at 
physiological conditions for salt and pH) . Following 
incubation, the beads or microtitre plate wells are 
washed to remove any unbound components, the matrix 
5 immobilized in the case of beads, complex determined 

either directly or indirectly, for example, as described 
above. Alternatively, the complexes can be dissociated 
from the matrix, and the level of CARD -3 or CARD-4 
binding or activity determined using standard techniques. 

10 In an alternative embodiment, MYC or HA epitope tag 

CARD-3 or CARD-4 fusion proteins or MYC or HA epitope tag 
target fusion proteins can be adsorbed onto ant i -MYC or 
ant i -HA antibody coated microbeads or onto ant i -MYC or 
anti-HA antibody coated microtitre plates, which are then 

15 combined with the test compound or the test compound and 
either the non- adsorbed target protein or CARD-3 or 
CARD-4 protein, and the mixture incubated under 
conditions conducive to complex formation (e.g., at 
physiological conditions for salt and pH) . Following 

20 incubation, the beads or microtitre plate wells are 
washed to remove any unbound components, the matrix 
immobilized in the case of beads, complex determined 
either directly or indirectly, for example, as described 
above. Alternatively, the complexes can be dissociated 

25 from the matrix, and the level of CARD-3 or CARD-4 

binding or activity determined using standard techniques. 
Example 12 describes an HA epitope tagged CARD-4 protein 
that physically interacts in a coimmunoprecipitation 
assay with MYC epitope tagged CARD-3. In an embodiment 

30 of the invention, HA epitope tagged CARD-4 could be used 
in combination with MYC epitope CARD-3 in the sort of 
protein-protein interaction assay described earlier in 
this paragraph. 

Other techniques for immobilizing proteins on 

35 matrices can also be used in the screening assays of the 



WO 99/40102 



PCT/US99/02544 



- 87 - 

invention. For example, either CARD-3 or CARD-4 or its 
target molecule can be immobilized utilizing conjugation 
of biotin and streptavidin . Biotinylated CARD-3 or 
CARD-4 or target molecules can be prepared from 
5 biotin-NHS (N-hydroxy-succinimide) using techniques well 
known in the art (e.g., biotinylat ion kit, Pierce 
Chemicals; Rockford, IL) , and immobilized in the wells of 
streptavidin-coated 96 well plates (Pierce Chemical) . 
Alternatively, antibodies reactive with CARD-3 or CARD-4 

10 or target molecules but which do not interfere with 
binding of the CARD-3 or CARD-4 protein to its target 
molecule can be derivatized to the wells of the plate, 
and unbound target or CARD-3 or CARD-4 trapped in the 
wells by antibody conjugation. Methods for detecting 

15 such complexes, in addition to those described above for 
the GST- immobilized complexes and epitope tag immobilized 
complexes, include immunodetection of complexes using 
antibodies reactive with the CARD-3 or CARD-4 or target 
molecule, as well as enzyme-linked assays which rely on 

2 0 detecting an enzymatic activity associated with the 
CARD-3 or CARD-4 or target molecule. 

In another embodiment, modulators of CARD-3 or 
CARD-4 expression are identified in a method in which a 
cell is contacted with a candidate compound and the 

2 5 expression of the CARD-3 or CARD-4 promoter, mRNA or 

protein in the cell is determined. The level of 
expression of CARD-3 or CARD-4 mRNA or protein in the 
presence of the candidate compound is compared to the 
level of expression of CARD-3 or CARD-4 mRNA or protein 

3 0 in the absence of the candidate compound. The candidate 

compound can then be identified as a modulator of CARD-3 
or CARD-4 expression based on this comparison. For 
example, when expression of CARD-3 or CARD-4 mRNA or 
protein is greater (statistically significantly greater) 
3 5 in the presence of the candidate compound than in its 
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absence, the candidate compound is identified as a 
stimulator of CARD- 3 or CARD-4 mRNA or protein 
expression. Alternatively, when expression of CARD- 3 or 
CARD-4 mRNA or protein is less (statistically 
5 significantly less) in the presence of the candidate 
compound than in its absence, the candidate compound is 
identified as an inhibitor of CARD-3 or CARD-4 mRNA or 
protein expression. The level of CARD-3 or CARD-4 mRNA 
or protein expression in the cells can be determined by 

10 methods described herein for detecting CARD-3 or CARD-4 
mRNA or protein. The activity of the CARD-3 or CARD-4 
promoter can be assayed by linking the CARD-3 or CARD-4 
promoter to a reporter gene such as lucif erase, secreted 
alkaline phosphatase, or beta-galactosidase and 

15 introducing the resulting construct into an appropriate 
vector, transfecting a host cell line, and measuring the 
activity of the reporter gene in response to test 
compounds. For example, two CARD-4 -specif ic mRNAs were 
detected in a Northern blotting experiment, one of 4.6 

2 0 kilobases and the other of 6.5-7.0 kilobases (Example 
11) . In Example 11, CARD-4-specif ic mRNA species were 
found to be widely distributed in the tissues and cell 
lines studied. 

In yet another aspect of the invention, the CARD-3 

25 or CARD-4 proteins can be used as "bait proteins" in a 
two-hybrid assay or three hybrid assay (see, e.g., U.S. 
Patent No. 5,283,317; Zervos et al . (1993) Cell 
72:223-232; Madura et al . (1993) J. Biol. Chem. 
268:12046-12054; Bartel et al . (1993) Bio/Techniques 

30 14:920-924; Iwabuchi et al . (1993) Oncogene 8:1693-1696; 
and PCT Publication No. WO 94/10300), to identify other 
proteins, which bind to or interact with CARD-3 or CARD-4 
("CARD-3 or CARD-4 -binding proteins" or "CARD-3 or 
CARD-4 -bp") and modulate CARD-3 or CARD-4 activity. Such 

35 CARD-3 or CARD-4 -binding proteins are also likely to be 
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involved in the propagation of signals by the CARD - 3 or 
CARD-4 proteins as, for example, upstream or downstream 
elements of the CARD- 3 or CARD-4 pathway. For example, 
Example 7 describes the construction of a two-hybrid 
5 screening bait construct including human CARD - 4 L amino 
acids 1-14 5 comprising the CARD domain and the use of 
this bait construct to screen human mammary gland and 
prostate gland two-hybrid libraries resulting in the 
identification of human CARD-3 as a CARD-4 interacting 

10 protein. In another example, Example 8 describes the 
construction of a two-hybrid screening bait construct 
including human CARD-4 amino acids 406-953 comprising the 
LRR domain and the use of this bait construct to screen a 
human mammary gland two-hybrid libraries resulting in the 

15 identification of hNUDC as a CARD-4 interacting protein. 

The two-hybrid system is based on the modular 
nature of most transcription factors, which consist of 
separable DNA-binding and activation domains. Briefly, 
the assay utilizes two different DNA constructs. In one 

20 construct, the gene that codes for CARD-3 or CARD-4 is 
fused to a gene encoding the DNA binding domain of a 
known transcription factor (e.g., GAL - 4 ) . In the other 
construct, a DNA sequence, from a library of DNA 
sequences, that encodes an unidentified protein ("prey" 

25 or "sample") is fused to a gene that codes for the 

activation domain of the known transcription factor. If 
the "bait" and the "prey" proteins are able to interact, 
in vivo, forming an CARD-3 or CARD-4 -dependent complex, 
the DNA-binding and activation domains of the 

3 0 transcription factor are brought into close proximity. 
This proximity allows transcription of a reporter gene 
(e.g., LacZ) which is operably linked to a 
transcriptional regulatory site responsive to the 
transcription factor. Expression of the reporter gene 

35 can be detected and cell colonies containing the 
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functional transcription factor can be isolated and used 
to obtain the cloned gene which encodes the protein which 
interacts with CARD -3 or CARD - 4 . 

In an embodiment of the invention, the ability of 
5 a test compound to modulate the activity of CARD-3, 

CARD-4, or a biologically active portion thereof can be 
determined by assaying the ability of the test compound 
to block the binding of CARD-3 and CARD-4 to their target 
proteins in a two-hybrid system assay. Example 7 

10 describes a two-hybrid system assay for the interaction 
between CARD- 3 and CARD-4 and Example 8 describes a 
two-hybrid system assay for the interaction between 
CARD-4 and its target protein hNUDC . To screen for test 
compounds that block the interaction between CARD- 3 and 

15 CARD-4 and their target proteins, which include but are 
not limited to CARD-3, CARD-4, and hNUDC, a yeast 
two-hybrid screening strain coexpressing the interacting 
bait and prey constructs, for example, a CARD-4 bait 
construct and a CARD-3 prey construct as described in 

2 0 Example 7, is contacted with the test compound and the 

activity of the two-hybrid system reporter gene, usually 
HIS3 , lacZ, or URA3 is assayed. If the strain remains 
viable but exhibits a significant decrease in reporter 
gene activity, this would indicate that the test compound 

25 has inhibited the interaction between the bait and prey 
proteins. This assay could be automated for high 
throughput drug screening purposes . In another 
embodiment of the invention, CARD-3 or CARD-4 and their 
target proteins could be configured in the reverse 

30 two-hybrid system (Vidal et al . (1996) Proc . Natl. Acad. 
Sci. USA 93:10321-6 and Vidal et al . (1996) Proc. Natl. 
Acad. Sci. USA 93:10315-20) designed specifically for 
efficient drug screening. In the reverse two-hybrid 
system, inhibition of a CARD-3 or CARD-4 physical 

3 5 interaction with a target protein would result in 
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induction of a reporter gene in contrast to the normal 
two-hybrid system where inhibition of CARD- 3 or CARD-4 
physical interaction with a target protein would lead to 
reporter gene repression. The reverse two-hybrid system 
5 is preferred for drug screening because reporter gene 
induction is more easily assayed than reporter gene 
repression . 

Alternative embodiments of the invention are 
proteins found to physically interact with proteins that 

10 bind to CARD-3 or CARD-4. CARD-3 or CARD-4 interactors, 
including but not limited to hNUDC and CARD-3, could be 
configured into two-hybrid system baits and used in 
two-hybrid screens to identify additional members of the 
CARD-3 and CARD-4 pathway. The interactors of CARD-3 or 

15 CARD-4 interactors identified in this way could be useful 
targets for therapeutic intervention in CARD-4 related 
diseases and pathologies and an assay of their enzymatic 
or binding activity could be useful for the 
identification of test compounds that modulate CARD-3 or 

20 CARD-4 activity. 

This invention further pertains to novel agents 
identified by the above-described screening assays and 
uses thereof for treatments as described herein. 



B. Detection Assays 

25 Portions or fragments of the cDNA sequences 

identified herein (and the corresponding complete gene 
sequences) can be used in numerous ways as polynucleotide 
reagents. For example, these sequences can be used to: 
(i) map their respective genes on a chromosome; and, 

30 thus, locate gene regions associated with genetic 
disease; (ii) identify an individual from a minute 
biological sample (tissue typing) ; and (iii) aid in 
forensic identification of a biological sample. These 
applications are described in the subsections below. 
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1 . Chromosome Mapping 

Once the sequence (or a portion of the sequence) 
of a gene has been isolated, this sequence can be used to 
map the location of the gene on a chromosome. 
5 Accordingly, CARD-3 or CARD-4 nucleic acid molecules 

described herein or fragments thereof, can be used to map 
the location of CARD-3 or CARD-4 genes on a chromosome. 
The mapping of the CARD-3 or CARD-4 sequences to 
chromosomes is an important first step in correlating 

10 these sequences with genes associated with disease. 

Briefly, CARD-3 or CARD-4 genes can be mapped to 
chromosomes by preparing PCR primers (preferably 15-25 bp 
in length) from the CARD-3 or CARD-4 sequences. Computer 
analysis of CARD-3 or CARD-4 sequences can be used to 

15 rapidly select primers that do not span more than one 
exon in the genomic DNA, thus complicating the 
amplification process. These primers can then be used 
for PCR screening of somatic cell hybrids containing 
individual human chromosomes . Only those hybrids 

2 0 containing the human gene corresponding to the CARD-3 or 
CARD-4 sequences will yield an amplified fragment. For 
example, in Example 6, human CARD-4 -specific PCR primers 
were used to screen DNAs from a somatic cell hybrid panel 
showing that human CARD-4 maps to chromosome 7 close to 

25 the SHGC-3192 8 genetic marker. 

Somatic cell hybrids are prepared by fusing 
somatic cells from different mammals (e.g., human and 
mouse cells) . As hybrids of human and mouse cells grow 
and divide, they gradually lose human chromosomes in 

30 random order, but retain the mouse chromosomes. By using 
media in which mouse cells cannot grow, because they lack 
a particular enzyme, but human cells can, the one human 
chromosome that contains the gene encoding the needed 
enzyme, will be retained. By using various media, panels 

35 of hybrid cell lines can be established. Each cell line 
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in a panel contains either a single human chromosome or a 
small number of human chromosomes, and a full set of 
mouse chromosomes, allowing easy mapping of individual 
genes to specific human chromosomes. (D'Eustachio et al . 
5 (1983) Science 220:919-924). Somatic cell hybrids 

containing only fragments of human chromosomes can also 
be produced by using human chromosomes with 
translocations and deletions. 

PCR mapping of somatic cell hybrids is a rapid 

10 procedure for assigning a particular sequence to a 

particular chromosome. Three or more sequences can be 
assigned per day using a single thermal cycler. Using 
the CARD-3 or CARD-4 sequences to design oligonucleotide 
primers, sublocalization can be achieved with panels of 

15 fragments from specific chromosomes. Other mapping 

strategies which can similarly be used to map a CARD-3 or 
CARD-4 sequence to its chromosome include in situ 
hybridization (described in Fan et al . (1990) Proc. Natl. 
Acad. Sci . USA 87:6223-27), pre - screening with labeled 

20 flow-sorted chromosomes, and pre-selection by 

hybridization to chromosome specific cDNA libraries. 

Fluorescence in situ hybridization (FISH) of a DNA 
sequence to a metaphase chromosomal spread can further be 
used to provide a precise chromosomal location in one 

2 5 step. Chromosome spreads can be made using cells whose 

division has been blocked in metaphase by a chemical like 
colcemid that disrupts the mitotic spindle. The 
chromosomes can be treated briefly with trypsin, and then 
stained with Giemsa. A pattern of light and dark bands 

3 0 develops on each chromosome, so that the chromosomes can 

be identified individually. The FISH technique can be 
used with a DNA sequence as short as 500 or 600 bases. 
However, clones larger than 1,000 bases have a higher 
likelihood of binding to a unique chromosomal location 
35 with sufficient signal intensity for simple detection. 
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Preferably 1,000 bases, and more preferably 2,000 bases 
will suffice to get good results at a reasonable amount 
of time. For a review of this technique, see Verma et 
al . , (Human Chromosomes: A Manual of Basic Techniques 
5 (Pergamon Press, New York, 1988)). 

Reagents for chromosome mapping can be used 
individually to mark a single chromosome or a single site 
on that chromosome, or panels of reagents can be used for 
marking multiple sites and/or multiple chromosomes. 

10 Reagents corresponding to noncoding regions of the genes 
actually are preferred for mapping purposes. Coding 
sequences are more likely to be conserved within gene 
families, thus increasing the chance of cross 
hybridizations during chromosomal mapping. 

15 Once a sequence has been mapped to a precise 

chromosomal location, the physical position of the 
sequence on the chromosome can be correlated with genetic 
map data. (Such data are found, for example, in V. 
McKusick, Mendelian Inheritance in Man, available on-line 

2 0 through Johns Hopkins University Welch Medical Library) . 
The relationship between genes and disease, mapped to the 
same chromosomal region, can then be identified through 
linkage analysis (co- inheritance of physically adjacent 
genes), described in, e.g., Egeland et al . (1987) Nature, 

25 325:783-787. 

Moreover, differences in the DNA sequences between 
individuals affected and unaffected with a disease 
associated with the CARD-3 or CARD-4 gene can be 
determined. If a mutation is observed in some or all of 

30 the affected individuals but not in any unaffected 
individuals, then the mutation is likely to be the 
causative agent of the particular disease. Comparison of 
affected and unaffected individuals generally involves 
first looking for structural alterations in the 

35 chromosomes such as deletions or translocations that are 
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visible from chromosome spreads or detectable using PCR 
based on that DNA sequence. Ultimately, complete 
sequencing of genes from several individuals can be 
performed to confirm the presence of a mutation and to 
5 distinguish mutations from polymorphisms. 
2. Tissue Typing 

The CARD- 3 or CARD-4 sequences of the present 
invention can also be used to identify individuals from 
minute biological samples. The United States military, 

10 for example, is considering the use of restriction 

fragment length polymorphism (RFLP) for identification of 
its personnel. In this technique, an individual's 
genomic DNA is digested with one or more restriction 
enzymes, and probed on a Southern blot to yield unique 

15 bands for identification. This method does not suffer 
from the current limitations of "Dog Tags" which can be 
lost, switched, or stolen, making positive identification 
difficult. The sequences of the present invention are 
useful as additional DNA markers for RFLP (described in 

20 U.S. Patent 5,272,057). 

Furthermore, the sequences of the present 
invention can be used to provide an alternative technique 
which determines the actual base-by-base DNA sequence of 
selected portions of an individual's genome. Thus, the 

25 CARD-3 or CARD-4 sequences described herein can be used 
to prepare two PCR primers from the 5' and 3' ends of the 
sequences. These primers can then be used to amplify an 
individual's DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from 

3 0 individuals, prepared in this manner, can provide unique 
individual identifications, as each individual will have 
a unique set of such DNA sequences due to allelic 
differences. The sequences of the present invention can 
be used to obtain such identification sequences from 

35 individuals and from tissue. The CARD-3 or CARD-4 
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sequences of the invention uniquely represent portions of 
the human genome. Allelic variation occurs to some 
degree in the coding regions of these sequences, and to a 
greater degree in the noncoding regions. It is estimated 
5 that allelic variation between individual humans occurs 
with a frequency of about once per each 500 bases. Each 
of the sequences described herein can, to some degree, be 
used as a standard against which DNA from an individual 
can be compared for identification purposes. Because 

10 greater numbers of polymorphisms occur in the noncoding 
regions, fewer sequences are necessary to differentiate 
individuals. The noncoding sequences of SEQ ID NO:l, SEQ 
ID NO: 7, SEQ ID NO: 25 and SEQ ID NO: 42 can comfortably 
provide positive individual identification with a panel 

15 of perhaps 10 to 1,000 primers which each yield a 

noncoding amplified sequence of 100 bases. If predicted 
coding sequences, such as those in SEQ ID NO: 3, SEQ ID 
NO: 9, and SEQ ID NO: 27 are used, a more appropriate 
number of primers for positive individual identification 

2 0 would be 500-2,000. 

If a panel of reagents from CARD-3 or CARD-4 
sequences described herein is used to generate a unique 
identification database for an individual, those same 
reagents can later be used to identify tissue from that 

25 individual. Using the unique identification database, 
positive identification of the individual, living or 
dead, can be made from extremely small tissue samples. 

3. Use of Partial CARD-3 or CARD-4 Sequences in 
Forensic Biology 

30 DNA-based identification techniques can also be 

used in forensic biology. Forensic biology is a 
scientific field employing genetic typing of biological 
evidence found at a crime scene as a means for positively 
identifying, for example, a perpetrator of a crime. To 

35 make such an identification, PCR technology can be used 
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to amplify DNA sequences taken from very small biological 
samples such as tissues, e.g., hair or skin, or body 
fluids, e.g., blood, saliva, or semen found at a crime 
scene. The amplified sequence can then be compared to a 
5 standard, thereby allowing identification of the origin 
of the biological sample. 

The sequences of the present invention can be used 
to provide polynucleotide reagents, e.g., PCR primers, 
targeted to specific loci in the human genome, which can 

10 enhance the reliability of DNA-based forensic 

identifications by, for example, providing another 
" identification marker" (i.e. another DNA sequence that 
is unique to a particular individual) . As mentioned 
above, actual base sequence information can be used for 

15 identification as an accurate alternative to patterns 
formed by restriction enzyme generated fragments. 
Sequences targeted to noncoding regions of SEQ ID NO : 1 , 
SEQ ID NO: 7, and SEQ ID NO: 25 are particularly 
appropriate for this use as greater numbers of 

2 0 polymorphisms occur in the noncoding regions, making it 
easier to differentiate individuals using this technique. 
Examples of polynucleotide reagents include the CARD- 3 or 
CARD-4 sequences or portions thereof, e.g., fragments 
derived from the noncoding regions of SEQ ID NO:l, SEQ ID 

25 NO: 7, or SEQ ID NO: 25 which have a length of at least 20 
or 3 0 bases. 

The CARD- 3 or CARD-4 sequences described herein 
can further be used to provide polynucleotide reagents, 
e.g., labeled or labelable probes which can be used in, 

30 for example, an in situ hybridization technique, to 

identify a specific tissue, e.g., brain tissue. This can 
be very useful in cases where a forensic pathologist is 
presented with a tissue of unknown origin. Panels of 
such CARD-3 or CARD-4 probes can be used to identify 

35 tissue by species and/or by organ type. 
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In a similar fashion, these reagents, e.g., CARD-3 
or CARD-4 primers or probes can be used to screen tissue 
culture for contamination (i.e., screen for the presence 
of a mixture of different types of cells in a culture) . 

5 C. Predictive Medicine 

The present invention also pertains to the field 
of predictive medicine in which diagnostic assays, 
prognostic assays, pharmacogenomics, and monitoring 
clinical trials are used for prognostic (predictive) 

10 purposes to thereby treat an individual prophylactically . 
Accordingly, one aspect of the present invention relates 
to diagnostic assays for determining CARD-3 or CARD-4 
protein and/or nucleic acid expression as well as CARD-3 
or CARD-4 activity, in the context of a biological sample 

15 (e.g., blood, serum, cells, tissue) to thereby determine 
whether an individual is afflicted with a disease or 
disorder, or is at risk of developing a disorder, 
associated with aberrant CARD-3 or CARD-4 expression or 
activity. The invention also provides for prognostic (or 

2 0 predictive) assays for determining whether an individual 
is at risk of developing a disorder associated with 
CARD-3 or CARD-4 protein, nucleic acid expression or 
activity. For example, mutations in a CARD-3 or CARD-4 
gene can be assayed in a biological sample. Such assays 

2 5 can be used for prognostic or predictive purpose to 

thereby prophylactically treat an individual prior to the 
onset of a disorder characterized by or associated with 
CARD-3 or CARD-4 protein, nucleic acid expression or 
activity. 

30 Another aspect of the invention provides methods 

for determining CARD-3 or CARD-4 protein, nucleic acid 
expression or CARD-3 or CARD-4 activity in an individual 
to thereby select appropriate therapeutic or prophylactic 
agents for that individual (referred to herein as 
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selection of agents (e.g., drugs) for therapeutic or 
prophylactic treatment of an individual based on the 
genotype of the individual (e.g., the genotype of the 
5 individual examined to determine the ability of the 
individual to respond to a particular agent.) 

Yet another aspect of the invention pertains to 
monitoring the influence of agents (e.g., drugs or other 
compounds) on the expression or activity of CARD- 3 or 
10 CARD-4 in clinical trials. 

These and other agents are described in further 
detail in the following sections. 

1. Diagnostic Assays 

An exemplary method for detecting the presence or 
15 absence of CARD- 3 or CARD-4 in a biological sample 
involves obtaining a biological sample from a test 
subject and contacting the biological sample with a 
compound or an agent capable of detecting CARD- 3 or 
CARD-4 protein or nucleic acid (e.g., mRNA, genomic DNA) 
20 that encodes CARD-3 or CARD-4 protein such that the 
presence of CARD-3 or CARD-4 is detected in the 
biological sample. An agent for detecting CARD-3 or 
CARD-4 mRNA or genomic DNA is a labeled nucleic acid 
probe capable of hybridizing to CARD-3 or CARD-4 mRNA or 

2 5 genomic DNA. The nucleic acid probe can be, for example, 

a full-length CARD-3 or CARD-4 nucleic acid, such as the 
nucleic acid of SEQ ID NO: 1 or 3 , SEQ ID NO: 7 or 9, SEQ 
ID NO: 25 or 27, or a portion thereof, such as an 
oligonucleotide of at least 15, 30, 50, 100, 250 or 500 
30 nucleotides in length and sufficient to specifically 

hybridize under stringent conditions to CARD-3 or CARD-4 
mRNA or genomic DNA, or a human CARD-4 splice variant 
such as the nucleic acid of SEQ ID NO: 38 or SEQ ID NO: 40. 
Other suitable probes for use in the diagnostic assays of 

3 5 the invention are described herein. For example, Example 
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11 describes the use of a nucleic acid probe to detect 
CARD-4 mRNAs in human tissues and cell lines and the 
probe used in this experiment could be used for a 
diagnostic assay. 

An agent for detecting CARD- 3 or CARD-4 protein 
can be an antibody capable of binding to CARD-3 or CARD-4 
protein, preferably an antibody with a detectable label. 
Antibodies can be polyclonal, or more preferably, 
monoclonal. For example, polypeptides corresponding to 
amino acids 128-139 and 287-298 of human CARD - 4 L were 
used to immunize rabbits and produce polyclonal 
antibodies that specifically recognize human CARD-4L. An 
intact antibody, or a fragment thereof (e.g., Fab or 
F(ab')2) can be used. The term "labeled", with regard to 
the probe or antibody, is intended to encompass direct 
labeling of the probe or antibody by coupling (i.e., 
physically linking) a detectable substance to the probe 
or antibody, as well as indirect labeling of the probe or 
antibody by reactivity with another reagent that is 
directly labeled. Examples of indirect labeling include 
detection of a primary antibody using a f luorescently 
labeled secondary antibody and end- labeling of a DNA 
probe with biotin such that it can be detected with 
f luorescently labeled streptavidin. The term "biological 
sample" is intended to include tissues, cells and 
biological fluids isolated from a subject, as well as 
tissues, cells and fluids present within a subject. That 
is, the detection method of the invention can be used to 
detect CARD-3 or CARD-4 mRNA, protein, or genomic DNA in 
a biological sample in vitro as well as in vivo. For 
example, in vitro techniques for detection of CARD-3 or 
CARD-4 mRNA include Northern hybridizations and in situ 
hybridizations. For example, Example 11 contains the use 
of a human CARD-4L nucleic acid probe for a Northern 
blotting analysis of mRNA species encoded by human 
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CARD-4L detected in RNA samples from human tissues and 
cell lines. In vitro techniques for detection of CARD- 3 
or CARD- 4 protein include enzyme linked immunosorbent 
assays (ELISAs) , Western blots, immunoprecipitat ions and 
5 immunofluorescence. In vitro techniques for detection of 
CARD-3 or CARD-4 genomic DNA include Southern 
hybridizations. Furthermore, in vivo techniques for 
detection of CARD-3 or CARD-4 protein include introducing 
into a subject a labeled anti-CARD-3 or CARD-4 antibody. 

10 For example, the antibody can be labeled with a 

radioactive marker whose presence and location in a 
subject can be detected by standard imaging techniques. 

In one embodiment, the biological sample contains 
protein molecules from the test subject. Alternatively, 

15 the biological sample can contain mRNA molecules from the 
test subject or genomic DNA molecules from the test 
subject. An biological sample is a peripheral blood 
leukocyte sample isolated by conventional means from a 
subject. 

2 0 In another embodiment, the methods further involve 

obtaining a control biological sample from a control 
subject, contacting the control sample with a compound or 
agent capable of detecting CARD-3 or CARD-4 protein, 
mRNA, or genomic DNA, such that the presence of CARD-3 or 

2 5 CARD-4 protein, mRNA or genomic DNA is detected in the 

biological sample, and comparing the presence of CARD-3 
or CARD-4 protein, mRNA or genomic DNA in the control 
sample with the presence of CARD-3 or CARD-4 protein, 
mRNA or genomic DNA in the test sample. 

3 0 The invention also encompasses kits for detecting 

the presence of CARD-3 or CARD-4 in a biological sample 
(a test sample) . Such kits can be used to determine if a 
subject is suffering from or is at increased risk of 
developing a disorder associated with aberrant expression 
35 of CARD-3 or CARD-4 (e.g., an immunological disorder). 
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For example, the kit can comprise a labeled compound or 
agent capable of detecting CARD-3 or CARD-4 protein or 
mRNA in a biological sample and means for determining the 
amount of CARD-3 or CARD-4 in the sample (e.g., an 
5 anti -CARD-3 or CARD-4 antibody or an oligonucleotide 

probe which binds to DNA encoding CARD-3 or CARD-4 , e.g., 
SEQ ID NO:l, SEQ ID NO : 3 , SEQ ID NO : 7 , SEQ ID NO : 9 , SEQ 
ID NO: 25 or SEQ ID NO: 27) . Kits may also include 
instruction for observing that the tested subject is 

10 suffering from or is at risk of developing a disorder 
associated with aberrant expression of CARD-3 or CARD-4 
if the amount of CARD-3 or CARD-4 protein or mRNA is 
above or below a normal level . 

For antibody-based kits, the kit may comprise, for 

15 example: (1) a first antibody (e.g., attached to a solid 
support) which binds to CARD-3 or CARD-4 protein; and, 
optionally, (2) a second, different antibody which binds 
to CARD-3 or CARD-4 protein or the first antibody and is 
conjugated to a detectable agent. 

20 For oligonucleotide-based kits, the kit may 

comprise, for example: (1) a oligonucleotide, e.g., a 
detectably labelled oligonucleotide, which hybridizes to 
a CARD-3 or CARD-4 nucleic acid sequence or (2) a pair of 
primers useful for amplifying a CARD-3 or CARD-4 nucleic 

25 acid molecule. 

The kit may also comprise, e.g., a buffering 
agent, a preservative, or a protein stabilizing agent. 
The kit may also comprise components necessary for 
detecting the detectable agent (e.g., an enzyme or a 

30 substrate) . The kit may also contain a control sample or 
a series of control samples which can be assayed and 
compared to the test sample contained. Each component of 
the kit is usually enclosed within an individual 
container and all of the various containers are within a 

35 single package along with instructions for observing 
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whether the tested subject is suffering from or is at 
risk of developing a disorder associated with aberrant 
expression of CARD-3 or CARD-4. 

2 . Prognostic Assays 
5 The methods described herein can furthermore be 

utilized as diagnostic or prognostic assays to identify 
subjects having or at risk of developing a disease or 
disorder associated with aberrant CARD-3 or CARD-4 
expression or activity. For example, the assays 

10 described herein, such as the preceding diagnostic assays 
or the following assays, can be utilized to identify a 
subject having or at risk of developing a disorder 
associated with CARD-3 or CARD-4 protein, nucleic acid 
expression or activity. Alternatively, the prognostic 

15 assays can be utilized to identify a subject having or at 
risk for developing such a disease or disorder. Thus, 
the present invention provides a method in which a test 
sample is obtained from a subject and CARD-3 or CARD-4 
protein or nucleic acid (e.g., mRNA, genomic DNA) is 

20 detected, wherein the presence of CARD-3 or CARD-4 
protein or nucleic acid is diagnostic for a subject 
having or at risk of developing a disease or disorder 
associated with aberrant CARD-3 or CARD-4 expression or 
activity. As used herein, a "test sample" refers to a 

25 biological sample obtained from a subject of interest. 
For example, a test sample can be a biological fluid 
(e.g., serum), cell sample, or tissue. Furthermore, 
the prognostic assays described herein can be used to 
determine whether a subject can be administered an agent 

30 (e.g., an agonist, antagonist, peptidomimetic, protein, 
peptide, nucleic acid, small molecule, or other drug 
candidate) to treat a disease or disorder associated with 
aberrant CARD-3 or CARD-4 expression or activity. For 
example, such methods can be used to determine whether a 

35 subject can be effectively treated with a specific agent 
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or class of agents (e.g., agents of a type which decrease 
CARD-3 or CARD-4 activity) . Thus, the present invention 
provides methods for determining whether a subject can be 
effectively treated with an agent for a disorder 
5 associated with aberrant CARD-3 or CARD-4 expression or 
activity in which a test sample is obtained and CARD-3 or 
CARD-4 protein or nucleic acid is detected (e.g., wherein 
the presence of CARD-3 or CARD-4 protein or nucleic acid 
is diagnostic for a subject that can be administered the 

10 agent to treat a disorder associated with aberrant CARD - 3 
or CARD-4 expression or activity) . 

The methods of the invention can also be used to 
detect genetic lesions or mutations in a CARD-3 or CARD-4 
gene, thereby determining if a subject with the lesioned 

15 gene is at risk for a disorder characterized by aberrant 
cell proliferation and/or differentiation. in preferred 
embodiments, the methods include detecting, in a sample 
of cells from the subject, the presence or absence of a 
genetic lesion characterized by at least one of an 

20 alteration affecting the integrity of a gene encoding a 
CARD-3 or CARD-4 -protein, or the mis-expression of the 
CARD-3 or CARD-4 gene. For example, such genetic lesions 
can be detected by ascertaining the existence of at least 
one of 1) a deletion of one or more nucleotides from a 

25 CARD-3 or CARD-4 gene; 2) an addition of one or more 

nucleotides to a CARD-3 or CARD-4 gene; 3) a substitution 
of one or more nucleotides of a CARD-3 or CARD-4 gene, 4) 
a chromosomal rearrangement of a CARD-3 or CARD-4 gene; 
5) an alteration in the level of a messenger RNA 

3 0 transcript of a CARD-3 or CARD-4 gene, 6) aberrant 

modification of a CARD-3 or CARD-4 gene, such as of the 
methylation pattern of the genomic DNA, 7) the presence 
of a non-wild type splicing pattern of a messenger RNA 
transcript of a CARD-3 or CARD-4 gene (e.g, caused by a 

35 mutation in a splice donor or splice acceptor site) , 8) a 
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non-wild type level of a CARD-3 or CARD- 4 -protein, 9) 
allelic loss of a CARD-3 or CARD-4 gene, and 10) 
inappropriate post-translational modification of a CARD-3 
or CARD-4 -protein . As described herein, there are a 
5 large number of assay techniques known in the art which 
can be used for detecting lesions in a CARD-3 or CARD-4 
gene. A biological sample is a peripheral blood 
leukocyte sample isolated by conventional means from a 
sub j ect . 

10 In certain embodiments, detection of the lesion 

involves the use of a probe/primer in a polymerase chain 
reaction (PCR) (see, e.g., U.S. Patent Nos . 4,683,195 and 
4,683,202), such as anchor PCR or RACE PCR, or, 
alternatively, in a ligation chain reaction (LCR) (see, 

15 e.g., Landegran et al . (1988) Science 241:1077-1080; and 
Nakazawa et al . (1994) Proc . Natl. Acad. Sci . USA 
91:360-364), the latter of which can be particularly 
useful for detecting point mutations in the CARD-3 or 
CARD-4-gene (see, e.g., Abravaya et al . (1995) Nucleic 

20 Acids Res. 23:675-682). This method can include the 
steps of collecting a sample of cells from a patient, 
isolating nucleic acid (e.g., genomic, mRNA or both) from 
the cells of the sample, contacting the nucleic acid 
sample with one or more primers which specifically 

2 5 hybridize to a CARD-3 or CARD-4 gene under conditions 
such that hybridization and amplification of the CARD-3 
or CARD-4-gene (if present) occurs, and detecting the 
presence or absence of an amplification product, or 
detecting the size of the amplification product and 

30 comparing the length to a control sample. It is 

anticipated that PCR and/or LCR may be desirable to use 
as a preliminary amplification step in conjunction with 
any of the techniques used for detecting mutations 
described herein. 
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Alternative amplification methods include: self 
sustained sequence replication (Guatelli et al . (1990) 
Proc. Natl. Acad. Sci . USA 87:1874-1878), transcriptional 
amplification system (Kwoh, et al . (1989) Proc. Natl. 
5 Acad. Sci. USA 86:1173-1177), Q-Beta Replicase (Lizardi 
et al . (1988) Bio/Technology 6:1197), or any other 
nucleic acid amplification method, followed by the 
detection of the amplified molecules using techniques 
well known to those of skill in the art. These detection 

10 schemes are especially useful for the detection of 

nucleic acid molecules if such molecules are present in 
very low numbers. 

In an alternative embodiment, mutations in a 
CARD- 3 or CARD-4 gene from a sample cell can be 

15 identified by alterations in restriction enzyme cleavage 
patterns. For example, sample and control DNA is 
isolated, amplified (optionally) , digested with one or 
more restriction endonucleases , and fragment length sizes 
are determined by gel electrophoresis and compared. 

20 Differences in fragment length sizes between sample and 
control DNA indicates mutations in the sample DNA. 
Moreover, the use of sequence specific ribozymes (see, 
e.g., U.S. Patent No. 5,498,531) can be used to score for 
the presence of specific mutations by development or loss 

25 of a ribozyme cleavage site. 

In other embodiments, genetic mutations in CARD-3 
or CARD-4 can be identified by hybridizing a sample and 
control nucleic acids, e.g., DNA or RNA, to high density 
arrays containing hundreds or thousands of 

30 oligonucleotides probes (Cronin et al . (1996) Human 

Mutation 7:244-255; Kozal et al . (1996) Nature Medicine 
2:753-759). For example, genetic mutations in CARD-3 or 
CARD-4 can be identified in two-dimensional arrays 
containing light-generated DNA probes as described in 

35 Cronin et al . supra. Briefly, a first hybridization 
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array of probes can be used to scan through long 
stretches of DNA in a sample and control to identify base 
changes between the sequences by making linear arrays of 
sequential overlapping probes. This step allows the 
5 identification of point mutations. This step is followed 
by a second hybridization array that allows the 
characterization of specific mutations by using smaller, 
specialized probe arrays complementary to all variants or 
mutations detected. Each mutation array is composed of 

10 parallel probe sets, one complementary to the wild-type 
gene and the other complementary to the mutant gene. 

In yet another embodiment, any of a variety of 
sequencing reactions known in the art can be used to 
directly sequence the CARD-3 or CARD-4 gene and detect 

15 mutations by comparing the sequence of the sample CARD-3 
or CARD-4 with the corresponding wild-type (control) 
sequence. Examples of sequencing reactions include those 
based on techniques developed by Maxam and Gilbert 
((1977) Proc. Natl. Acad. Sci . USA 74:560) or Sanger 

20 ((1977) Proc. Natl. Acad. Sci. USA 74:5463). It is also 
contemplated that any of a variety of automated 
sequencing procedures can be utilized when performing the 
diagnostic assays ((1995) Bio/Techniques 19:448), 
including sequencing by mass spectrometry (see, e.g., PCT 

25 Publication No. WO 94/16101; Cohen et al . (1996) Adv. 
Chromatogr. 36:127-162; and Griffin et al . (1993) Appl . 
Biochem. Biotechnol . 38:147-159). 

Other methods for detecting mutations in the 
CARD-3 or CARD-4 gene include methods in which protection 

3 0 from cleavage agents is used to detect mismatched bases 
in RNA/RNA or RNA/DNA heteroduplexes (Myers et al . (1985) 
Science 230:1242). In general, the art technique of 
"mismatch cleavage" starts by providing heteroduplexes of 
formed by hybridizing (labeled) RNA or DNA containing the 

35 wild-type CARD-3 or CARD-4 sequence with potentially 



WO 99/40102 



PCT/US99/02544 



- 108 - 

mutant RNA or DNA obtained from a tissue sample. The 
double -stranded duplexes are treated with an agent which 
cleaves single-stranded regions of the duplex such as 
which will exist due to basepair mismatches between the 
control and sample strands. For instance, RNA /DNA 
duplexes can be treated with RNase and DNA /DNA hybrids 
treated with SI nuclease to enzymatically digesting the 
mismatched regions. In other embodiments, either DNA /DNA 
or RNA /DNA duplexes can be treated with hydroxyl amine or 
osmium tetroxide and with piperidine in order to digest 
mismatched regions. After digestion of the mismatched 
regions, the resulting material is then separated by size 
on denaturing polyacryl amide gels to determine the site 
of mutation. See, e.g., Cotton et al (1988) Proc . Natl 
Acad Sci USA 85:4397; Saleeba et al (1992) Methods 
Enzymol . 217:286-295. In an embodiment, the control DNA 
or RNA can be labeled for detection. 

In still another embodiment, the mismatch cleavage 
reaction employs one or more proteins that recognize 
mismatched base pairs in double -stranded DNA (so called 
"DNA mismatch repair" enzymes) in defined systems for 
detecting and mapping point mutations in CARD- 3 or CARD -4 
cDNAs obtained from samples of cells. For example, the 
mutY enzyme of E. coli cleaves A at G/A mismatches and 
the thymidine DNA glycosylase from HeLa cells cleaves T 
at G/T mismatches (Hsu et al . (1994) Carcinogenesis 
15:1657-1662). According to an exemplary embodiment, a 
probe based on a CARD-3 or CARD-4 sequence, e.g., a 
wild-type CARD-3 or CARD-4 sequence, is hybridized to a 
cDNA or other DNA product from a test cell(s) . The 
duplex is treated with a DNA mismatch repair enzyme, and 
the cleavage products, if any, can be detected from 
electrophoresis protocols or the like. See, e.g., U.S. 
Patent No. 5,459,039. 
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In other embodiments, alterations in 
electrophoretic mobility will be used to identify 
mutations in CARD-3 or CARD-4 genes. For example, single 
strand conformation polymorphism (SSCP) may be used to 
5 detect differences in electrophoretic mobility between 
mutant and wild type nucleic acids (Orita et al . (1989) 
Proc Natl. Acad. Sci USA: 86:2766, see also Cotton (1993) 
Mutat . Res. 285:125-144; and Hayashi (1992) Genet Anal 
Tech Appl 9:73-79). Single - stranded DNA fragments of 

10 sample and control CARD-3 or CARD-4 nucleic acids will be 
denatured and allowed to renature . The secondary 
structure of single-stranded nucleic acids varies 
according to sequence, the resulting alteration in 
electrophoretic mobility enables the detection of even a 

15 single base change. The DNA fragments may be labeled or 
detected with labeled probes. The sensitivity of the 
assay may be enhanced by using RNA (rather than DNA) , in 
which the secondary structure is more sensitive to a 
change in sequence. In an embodiment, the subject method 

20 utilizes heteroduplex analysis to separate double 

stranded heteroduplex molecules on the basis of changes 
in electrophoretic mobility (Keen et al . (1991) Trends 
Genet 7:5). 

In yet another embodiment, the movement of mutant 
25 or wild-type fragments in polyacryl amide gels containing 
a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (DGGE) (Myers et al . (1985) 
Nature 313:495) . When DGGE is used as the method of 
analysis, DNA will be modified to insure that it does not 
3 0 completely denature, for example by adding a GC clamp of 
approximately 4 0 bp of high-melting GC-rich DNA by PCR . 
In a further embodiment, a temperature gradient is used 
in place of a denaturing gradient to identify differences 
in the mobility of control and sample DNA (Rosenbaum and 
35 Reissner (1987) Biophys Chem 265:12753). 
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Examples of other techniques for detecting point 
mutations include, but are not limited to, selective 
oligonucleotide hybridization, selective amplification, 
or selective primer extension. For example, 
5 oligonucleotide primers may be prepared in which the 

known mutation is placed centrally and then hybridized to 
target DNA under conditions which permit hybridization 
only if a perfect match is found (Saiki et al . (1986) 
Nature 324:163); Saiki et al . (1989) Proc . Natl Acad. Sci 

10 USA 86:6230). Such allele specific oligonucleotides are 
hybridized to PCR amplified target DNA or a number of 
different mutations when the oligonucleotides are 
attached to the hybridizing membrane and hybridized with 
labeled target DNA. 

15 Alternatively, allele specific amplification 

technology which depends on selective PCR amplification 
may be used in conjunction with the instant invention. 
Oligonucleotides used as primers for specific 
amplification may carry the mutation of interest in the 

20 center of the molecule (so that amplification depends on 
differential hybridization) (Gibbs et al . (1989) Nucleic 
Acids Res. 17:2437-2448) or at the extreme 3' end of one 
primer where, under appropriate conditions, mismatch can 
prevent, or reduce polymerase extension (Prossner (1993) 

25 Tibtech 11:238). In addition, it may be desirable to 
introduce a novel restriction site in the region of the 
mutation to create cleavage-based detection (Gasparini et 
al . (1992) Mol. Cell Probes 6:1). It is anticipated that 
in certain embodiments amplification may also be 

30 performed using Taq ligase for amplification (Barany 

(1991) Proc. Natl. Acad. Sci USA 88:189). In such cases, 
ligation will occur only if there is a perfect match at 
the 3' end of the 5' sequence making it possible to 
detect the presence of a known mutation at a specific 
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site by looking for the presence or absence of 
amplification . 

The methods described herein may be performed, for 
example, by utilizing pre-packaged diagnostic kits 
5 comprising at least one probe nucleic acid or antibody 
reagent described herein, which may be conveniently used, 
e.g., in clinical settings to diagnose patients 
exhibiting symptoms or family history of a disease or 
illness involving a CARD-3 or CARD-4 gene. 

10 Furthermore, any cell type or tissue, preferably 

peripheral blood leukocytes, in which CARD-3 or CARD-4 is 
expressed may be utilized in the prognostic assays 
described herein. 

3 . Pharmacogenomics 

15 Agents, or modulators which have a stimulatory or 

inhibitory effect on CARD-3 or CARD-4 activity (e.g., 
CARD-3 or CARD-4 gene expression) as identified by a 
screening assay described herein can be administered to 
individuals to treat (prophylact ically or 

20 therapeutically) disorders (e.g., an immunological 
disorder) associated with aberrant CARD-3 or CARD-4 
activity. In conjunction with such treatment, the 
pharmacogenomics (i.e., the study of the relationship 
between an individual ' s genotype and that individual ' s 

2 5 response to a foreign compound or drug) of the individual 

may be considered. Differences in metabolism of 
therapeutics can lead to severe toxicity or therapeutic 
failure by altering the relation between dose and blood 
concentration of the pharmacologically active drug. Thus, 

3 0 the pharmacogenomics of the individual permits the 

selection of effective agents (e.g., drugs) for 
prophylactic or therapeutic treatments based on a 
consideration of the individual's genotype. Such 
pharmacogenomics can further be used to determine 
35 appropriate dosages and therapeutic regimens. 
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Accordingly, the activity of CARD-3 or CARD-4 protein, 
expression of CARD-3 or CARD-4 nucleic acid, or mutation 
content of CARD-3 or CARD-4 genes in an individual can be 
determined to thereby select appropriate agent (s) for 
5 therapeutic or prophylactic treatment of the individual. 

Pharmacogenomics deals with clinically significant 
hereditary variations in the response to drugs due to 
altered drug disposition and abnormal action in affected 
persons. See, e.g., Linder (1997) Clin. Chem. 

10 43 (2 ): 254 -266 . In general, two types of pharmacogenetic 
conditions can be differentiated. Genetic conditions 
transmitted as a single factor altering the way drugs act 
on the body (altered drug action) or genetic conditions 
transmitted as single factors altering the way the body 

15 acts on drugs (altered drug metabolism) . These 

pharmacogenetic conditions can occur either as rare 
defects or as polymorphisms. For example, 

glucose- 6 -phosphate dehydrogenase deficiency (G6PD) is a 
common inherited enzymopathy in which the main clinical 

2 0 complication is haemolysis after ingestion of oxidant 
drugs (ant i-malarials , sulfonamides, analgesics, 
nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of 
drug metabolizing enzymes is a major determinant of both 

25 the intensity and duration of drug action. The discovery 
of genetic polymorphisms of drug metabolizing enzymes 
(e.g., N-acetyltransf erase 2 (NAT 2) and cytochrome P450 
enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug 

30 effects or show exaggerated drug response and serious 
toxicity after taking the standard and safe dose of a 
drug. These polymorphisms are expressed in two 
phenotypes in the population, the extensive metabolizer 
(EM) and poor metabolizer (PM) . The prevalence of PM is 

35 different among different populations. For example, the 
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gene coding for CYP2D6 is highly polymorphic and several 
mutations have been identified in PM, which all lead to 
the absence of functional CYP2D6 . Poor metabolizers of 
CYP2D6 and CYP2C19 quite frequently experience 
5 exaggerated drug response and side effects when they 
receive standard doses. If a metabolite is the active 
therapeutic moiety, PM show no therapeutic response, as 
demonstrated for the analgesic effect of codeine mediated 
by its CYP2D6- formed metabolite morphine. The other 

10 extreme are the so called ultra-rapid metabolizers who do 
not respond to standard doses. Recently, the molecular 
basis of ultra-rapid metabolism has been identified to be 
due to CYP2D6 gene amplification. 

Thus, the activity of CARD-3 or CARD-4 protein, 

15 expression of CARD- 3 or CARD-4 nucleic acid, or mutation 
content of CARD- 3 or CARD-4 genes in an individual can be 
determined to thereby select appropriate agent (s) for 
therapeutic or prophylactic treatment of the individual. 
In addition, pharmacogenet ic studies can be used to apply 

2 0 genotyping of polymorphic alleles encoding 

drug-metabolizing enzymes to the identification of an 
individual's drug responsiveness phenotype . This 
knowledge, when applied to dosing or drug selection, can 
avoid adverse reactions or therapeutic failure and thus 

2 5 enhance therapeutic or prophylactic efficiency when 
treating a subject with a CARD-3 or CARD-4 modulator, 
such as a modulator identified by one of the exemplary 
screening assays described herein. 

4. Monitoring of Effects During Clinical Trials 

30 Monitoring the influence of agents (e.g., drugs, 

compounds) on the expression or activity of CARD-3 or 
CARD-4 (e.g., the ability to modulate aberrant cell 
proliferation and/or differentiation) can be applied not 
only in basic drug screening, but also in clinical 

35 trials. For example, the effectiveness of an agent 
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determined by a screening assay as described herein to 
increase CARD- 3 or CARD-4 gene expression, protein 
levels, or upregulate CARD-3 or CARD-4 activity, can be 
monitored in clinical trails of subjects exhibiting 
5 decreased CARD-3 or CARD-4 gene expression, protein 
levels, or downregulated CARD-3 or CARD-4 activity. 
Alternatively, the effectiveness of an agent determined 
by a screening assay to decrease CARD-3 or CARD-4 gene 
expression, protein levels, or downregulated CARD-3 or 

10 CARD-4 activity, can be monitored in clinical trials of 
subjects exhibiting increased CARD-3 or CARD-4 gene 
expression, protein levels, or upregulated CARD-3 or 
CARD-4 activity. In such clinical trials, the expression 
or activity of CARD-3 or CARD-4 and, preferably, other 

15 genes that have been implicated in, for example, a 

cellular proliferation disorder can be used as a "read 
out" or markers of the immune responsiveness of a 
particular cell. 

For example, and not by way of limitation, genes, 

2 0 including CARD-3 or CARD-4, that are modulated in cells 

by treatment with an agent (e.g., compound, drug or small 
molecule) which modulates CARD-3 or CARD-4 activity 
(e.g., identified in a screening assay as described 
herein) can be identified. Thus, to study the effect of 
25 agents on cellular proliferation disorders, for example, 
in a clinical trial, cells can be isolated and RNA 
prepared and analyzed for the levels of expression of 
CARD-3 or CARD-4 and other genes implicated in the 
disorder. The levels of gene expression (i.e., a gene 

3 0 expression pattern) can be quantified by Northern blot 

analysis or RT-PCR, as described herein, or alternatively 
by measuring the amount of protein produced, by one of 
the methods as described herein, or by measuring the 
levels of activity of CARD-3 or CARD-4 or other genes. 
35 In this way, the gene expression pattern can serve as a 
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marker, indicative of the physiological response of the 
cells to the agent. Accordingly, this response state may 
be determined before, and at various points during, 
treatment of the individual with the agent. 
5 In an embodiment, the present invention provides a 

method for monitoring the effectiveness of treatment of a 
subject with an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small 
molecule, or other drug candidate identified by the 

10 screening assays described herein) comprising the steps 
of (i) obtaining a pre-administration sample from a 
subject prior to administration of the agent; (ii) 
detecting the level of expression of a CARD-3 or CARD-4 
protein, mRNA, or genomic DNA in the preadministrat ion 

15 sample; (iii) obtaining one or more post-administration 
samples from the subject; (iv) detecting the level of 
expression or activity of the CARD-3 or CARD-4 protein, 
mRNA , or genomic DNA in the post -administration samples; 
(v) comparing the level of expression or activity of the 

20 CARD-3 or CARD-4 protein, mRNA, or genomic DNA in the 
pre-administration sample with the CARD-3 or CARD-4 
protein, mRNA, or genomic DNA in the post administration 
sample or samples; and (vi) altering the administration 
of the agent to the subject accordingly. For example, 

25 increased administration of the agent may be desirable to 
increase the expression or activity of CARD-3 or CARD-4 
to higher levels than detected, i.e., to increase the 
effectiveness of the agent. Alternatively, decreased 
administration of the agent may be desirable to decrease 

30 expression or activity of CARD-3 or CARD-4 to lower 

levels than detected, i.e., to decrease the effectiveness 
of the agent . 



WO 99/40102 



PCT/US99/02544 



- 116 - 

C. Methods of Treatment 

The present invention provides for both 
prophylactic and therapeutic methods of treating a 
subject at risk of (or susceptible to) a disorder or 
5 having a disorder associated with aberrant CARD- 3 or 
CARD-4 expression or activity. 

1. Prophylactic Methods 

In one aspect, the invention provides a method for 
preventing in a subject, a disease or condition 

10 associated with an aberrant CARD-3 or CARD-4 expression 
or activity, by administering to the subject an agent 
which modulates CARD-3 or CARD-4 expression or at least 
one CARD-3 or CARD-4 activity. Subjects at risk for a 
disease which is caused or contributed to by aberrant 

15 CARD-3 or CARD-4 expression or activity can be identified 
by, for example, any or a combination of diagnostic or 
prognostic assays as described herein. Administration of 
a prophylactic agent can occur prior to the manifestation 
of symptoms characteristic of the CARD-3 or CARD-4 

20 aberrancy, such that a disease or disorder is prevented 
or, alternatively, delayed in its progression. Depending 
on the type of CARD-3 or CARD-4 aberrancy, for example, a 
CARD-3 or CARD-4 agonist or CARD-3 or CARD-4 antagonist 
agent can be used for treating the subject. The 

25 appropriate agent can be determined based on screening 
assays described herein. Activities of CARD-3 or CARD-4 
that could be modulated for prophylactic purposes 
include, but are not limited to, 1) CARD-3 or CARD-4 gene 
or protein expression, for example, see Example 11 for a 

30 description of the mRNA expression pattern of human 

CARD-4; 2) CARD-3 or CARD-4 binding to a target protein, 
for example, see Examples 7, 8, and 12 for a description 
of proteins known to bind to CARD-3 or CARD-4; 3) CARD-4 
regulation of NF-kB as described in Example 9; and 4) 
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CARD- 3 and CARD-4 enhancement of caspase 9 activity as 
described in Example 10. 

2 . Therapeutic Methods 

Another aspect of the invention pertains to 
5 methods of modulating CARD-3 or CARD-4 expression or 

activity for therapeutic purposes. The modulatory method 
of the invention involves contacting a cell with an agent 
that modulates one or more of the activities of CARD-3 or 
CARD-4 protein activity associated with the cell. An 

10 agent that modulates CARD-3 or CARD-4 protein activity 
can be an agent as described herein, such as a nucleic 
acid or a protein, a naturally-occurring cognate ligand 
of a CARD-3 or CARD-4 protein, a peptide, a CARD-3 or 
CARD-4 peptidomimetic, or other small molecule. In one 

15 embodiment, the agent stimulates one or more of the 
biological activities of CARD-3 or CARD-4 protein. 
Examples of such stimulatory agents include active CARD-3 
or CARD-4 protein and a nucleic acid molecule encoding 
CARD-3 or CARD-4 that has been introduced into the cell. 

2 0 In another embodiment, the agent inhibits one or more of 
the biological activities of CARD-3 or CARD-4 protein. 
Examples of such inhibitory agents include antisense 
CARD-3 or CARD-4 nucleic acid molecules and anti -CARD-3 
or CARD-4 antibodies. These modulatory methods can be 

25 performed in vitro (e.g., by culturing the cell with the 
agent) or, alternatively, in vivo (e.g, by administering 
the agent to a subject) . As such, the present invention 
provides methods of treating an individual afflicted with 
a disease or disorder characterized by aberrant 

30 expression or activity of a CARD-3 or CARD-4 protein or 
nucleic acid molecule. In one embodiment, the method 
involves administering an agent (e.g., an agent 
identified by a screening assay described herein) , or 
combination of agents that modulates (e.g., upregulates 

35 or downregulates) CARD-3 or CARD-4 expression or 
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activity. In another embodiment, the method involves 
administering a CARD-3 or CARD-4 protein or nucleic acid 
molecule as therapy to compensate for reduced or aberrant 
CARD-3 or CARD-4 expression or activity. Activities of 
5 CARD-3 or CARD-4 that could be modulated for therapeutic 
purposes include, but are not limited to, 1) CARD-3 or 
CARD-4 gene or protein expression, for example, see 
Example 11 for a description of the mRNA expression 
pattern of human CARD-4; 2) CARD-3 or CARD-4 binding to a 

10 target protein, for example, see Examples 7, 8, and 12 

for a description of proteins known to bind to CARD-3 or 
CARD-4; 3) CARD-4 regulation of NF-kB as described in 
Example 9; and 4) CARD-4 enhancement of caspase 9 
activity as described in Example 10. 

15 Stimulation of CARD-3 or CARD-4 activity is 

desirable in situations in which CARD-3 or CARD-4 is 
abnormally downregulated and/or in which increased CARD-3 
or CARD-4 activity is likely to have a beneficial effect. 
Conversely, inhibition of CARD-3 or CARD-4 activity is 

20 desirable in situations in which CARD-3 or CARD-4 is 

abnormally upregulated, e.g., in myocardial infarction, 
and/or in which decreased CARD-3 or CARD-4 activity is 
likely to have a beneficial effect. Since CARD-4 may 
play be involved in the processing of cytokines, 

25 inhibiting the activity or expression CARD4- may be 

beneficial in patients that have aberrant inflammation. 

This invention is further illustrated by the 
following examples which should not be construed as 
limiting. The contents of all references, patents and 

30 published patent applications cited throughout this 
application are hereby incorporated by reference. 
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EXAMPLES 

Example 1: Isolation and Characterization of 
full- length Human CARD - 3 and CARD-4L/S cDNAs . 

A profile of known CARD domains was used to search 
5 databases of cDNA sequences and partial cDNA sequences 
using TBLASTN (Washington University; version 2.0, 
BLOSUM62 search matix) . This search led to the 
identification of CARD- 3 . Using CARD-3 to search 

databases of cDNA sequences and partial cDNA sequences, 

10 another potential CARD cDNA was found. This cDNA 
sequence was used screen a human umbilical vein 
endothelial library (HUVE) and a clone containing the 
partial CARD-4S was identified. The human umbilical vein 
endothelial library was then rescreened using a probe 

15 designed against the partial CARD-4S sequence and a clone 
containing the CARD-4L sequence was identified. 

Example 2: Characterization of CARD-3 AND 
CARD-4L/S Proteins. 

In this example, the predicted amino acid 

2 0 sequences of human CARD-3 and CARD- 4 L/ S proteins were 
compared to amino acid sequences of known proteins and 
various motifs were identified. For example, the CARD 
domains of CARD-3 and CARD-4 were aligned (Figure 7) with 
the CARD domains of ARC-CARD (SEQ ID N0:31), cIAPl-CARD 

25 (SEQ ID NO:32) and CIAP2-CARD (SEQ ID NO:33) . In 

addition, the molecular weight of the human CARD-3 and 
CARD-4L/S proteins were predicted. 

The human CARD-3 cDNA was isolated as described 
above (Figure 1; SEQ ID N0:1) and encodes a 54 0 amino 

30 acid protein (Figure 2: SEQ ID NO:2). CARD-3 also 

includes one predicted kinase domain (amino acid 1 to 
amino acid 300 of SEQ ID NO:2; SEQ ID NO : 4 ) , which is 
followed by a predicted linker domain (amino acid 301 to 
amino acid 431 of SEQ ID NO: 2; SEQ ID NO: 5) and a 
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predicted CARD domain (amino acid 4 32 to amino acid 54 0 
Of SEQ ID NO: 2; SEQ ID NO: 6). 

The human CARD - 4 L cDNA was isolated as described 
above (Figure 3; SEQ ID NO: 7) and has a 2859 nucleotide 
5 open reading frame (nucleotides 245-3103 of SEQ ID NO : 7 ; 
SEQ ID NO: 9) which encodes a 953 amino acid protein 
(Figure 4; SEQ ID NO: 8) . CARD-4L protein has a predicted 
CARD domain (amino acids 15-114; SEQ ID NO:10). CARD-4L 
is also predicted to have a nucleotide binding domain 

10 which extends from about amino acid 198 to about amino 

acid 3 97 of SEQ ID NO : 8 ; SEQ ID NO: 11, a predicted Walker 
Box "A", which extends from about amino acid 202 to about 
amino acid 209 of SEQ ID NO: 8; SEQ ID NO: 12, a predicted 
Walker Box "B" , which extends from about amino acid 280 

15 to about amino acid 2 84, of SEQ ID NO: 8; SEQ ID NO: 13, a 
predicted kinase la (P-loop) domain, which extends from 
about amino acid 197 to about amino acid 212 of SEQ ID 
NO: 8; SEQ ID NO: 46, a predicted kinase 2 domain, which 
extends from about amino acid 273 to about amino acid 288 

20 of SEQ ID NO: 8; SEQ ID NO: 47, a predicted kinase 3a 
subdomain, which extends from about amino acid 32 7 to 
about amino acid 33 8 of SEQ ID NO: 8; SEQ ID NO: 14, ten 
predicted Leucine -rich repeats which extend from about 
amino acid 674 to about amino acid 950 of SEQ ID NO: 8. 

25 The first Leucine-rich repeat is predicted to extend from 
about amino acid 674 to about amino acid 701 of SEQ ID 
NO: 8; SEQ ID NO: 15. The second Leucine-rich repeat is 
predicted to extend from about amino acid 7 02 to about 
amino acid 727 of SEQ ID NO: 8; SEQ ID NO: 16. The third 

30 Leucine-rich repeat is. predicted to extend from about 
amino acid 728 to about amino acid 754 of SEQ ID NO : 8 ; 
SEQ ID NO: 17. The fourth Leucine-rich repeat is 
predicted to extend from about amino acid 755 to about 
amino acid 782 of SEQ ID NO: 8; SEQ ID NO: 18. The fifth 

3 5 Leucine-rich repeat is predicted to extend from about 
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amino acid 783 to about amino acid 810 of SEQ ID NO : 8 ; 
SEQ ID NO: 19. The sixth Leucine - rich repeat is predicted 
to extend from about amino acid 811 to about amino acid 
83 8 of SEQ ID NO : 8 ; SEQ ID NO : 2 0 . The seventh 
5 Leucine-rich repeat is predicted to extend from about 
amino acid 839 to about amino acid 866 of SEQ ID NO: 8; 
SEQ ID NO: 21. The eighth Leucine-rich repeat is 
predicted to extend from about amino acid 867 to about 
amino acid 894 of SEQ ID NO : 8 ; SEQ ID NO : 2 2 . The ninth 

10 Leucine-rich repeat is predicted to extend from about 
amino acid 895 to about amino acid 922 of SEQ ID NO: 8; 
SEQ ID NO: 23 and the tenth leucine-rich repeat is 
predicted to extend from about amino acid 923 to about 
amino acid 950 of SEQ ID NO: 8; SEQ ID NO: 24. 

15 The human partial CARD - 4 S cDNA isolated as 

described above (Figure 5; SEQ ID NO: 25) encodes a 4 90 
amino acid protein (Figure 6; SEQ ID NO: 26) . CARD - 4 S 
includes one predicted partial CARD domain (amino acids 
1-74 of SEQ ID NO: 26) . CARD - 4 S is also predicted to have 

2 0 a P-Loop which extends from about amino acid 163 to about 
amino acid 170 of SEQ ID NO: 26; SEQ ID NO: 29, and a 
predicted Walker Box "B" which extends form about amino 
acid 241 to about amino acid 245 of SEQ ID NO: 26; SEQ ID 
NO: 30 . 

25 A plot showing the predicted structural features 

of CARD-4L is presented in Figure 8* This figure shows 
the predicted alpha regions (Gamier-Robinson and 
Chou-Fasman) , the predicted beta regions 
(Gamier-Robinson and Chou-Fasman) , the predicted turn 

30 regions (Garnier-Robinson and Chou-Fasman) and the 
predicted coil regions (Garnier-Robinson and 
Chou-Fasman) . Also included in the figure is a 
hydrophilicity plot (Kyte-Doolittle) , the predicted alpha 
and beta-amphatic regions (Eisenberg) , the predicted 

35 flexible regions (Karplus-Schulz) , the predicted 
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antigenic index (Jameson-Wolf) and the predicted surface 
probability plot (Emini) . 

A plot showing the predicted sturctural features 
of CARD- 4 S is also presented in Figure 9. This figure 
5 shows the predicted alpha regions (Garnier-Robinson and 
Chou-Fasman) , the predicted beta regions 
(Gamier-Robinson and Chou-Fasman) , the predicted turn 
regions (Gamier-Robinson and Chou-Fasman) and the 
predicted coil regions (Garnier-Robinson and 

10 Chou-Fasman) . Also included in the figure is a 

hydrophilicity plot (Kyte-Doolittle) , the predicted alpha 
and beta-amphatic regions (Eisenberg) , the predicted 
flexible regions (Karplus-Schulz) , the predicted 
antigenic index (Jameson- Wolf ) and the predicted surface 

15 probability plot (Emini) . 

The predicted MW of CARD- 3 is approximately 61 
kDa. The predicted MW of CARD-4L is approximately 108 
kDa . 

Example 3: Preparation of CARD -3 and CARD -4 Proteins 

2 0 Recombinant CARD -3 and CARD -4 can be produced in a 

variety of expression systems. For example, the CARD- 3 
and CARD -4 peptides can be expressed as a recombinant 
glutathione-S- transferase (GST) fusion protein in E. coli 
and the fusion protein can be isolated and characterized. 
25 Specifically, as described above, CARD-3 or CARD-4 can be 
fused to GST and the fusion protein can be expressed in 
E. coli strain PEB199. Expression of the GST-CARD-3 or 
GST-CARD-4 fusion protein in PEB199 can be induced with 
IPTG. The recombinant fusion protein can be purified 

3 0 from crude bacterial lysates of the induced PEB199 strain 

by affinity chromatography on glutathione beads. 
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Example 4: Identification of splice variants of CARD 4 . 

The 5' untranslated sequence from CARD - 4 L was used 
to search databases of cDNA sequences and partial cDNA 
sequences using BLASTN (Washington University; version 
5 2.0, BLOSUM62 search matrix) for additional CARD-4 cDNA 
clones. This search led to the identification of two 
cDNA clones, clone Z from a human lymph node library and 
the Y clone from a human brain cDNA library. Both clones 
were sequenced and found to represent probable splice 

10 variants of CARD-4 that encode truncated CARD-4 proteins, 
Y encoding a 24 9 amino acid protein and Z encoding a 164 
amino acid protein. Fig. 10 shows the nucleotide (SEQ ID 
NO: 38) and Fig. 11 the predicted amino acid (SEQ ID 
NO: 3 9) sequences of human CARD-4Y; Fig. 12 shows the 

15 nucleotide (SEQ ID NO: 40) and Fig. 13 the amino acid (SEQ 
ID NO: 41) sequences of human CARD - 4 Z ; and Fig. 14 shows 
an alignment of the CARD-4L, CARD- 4 Y , and CARD - 4 Z amino 
acid sequences generated by the Clustal program using a 
PAM250 residue weight table. 

20 Example 5: Identification of murine CARD-4 . 

The CARD-4 polypeptide sequence was used to search 
databases of cDNA sequences and partial cDNA sequences 
using the TBLASTN program (version 1.4, BLOSUM62 search 
matrix, and a word length of 3) for murine CARD-4 cDNA 

25 clones. This search led to the identification of a 

partial murine CARD-4 clone designated murine CARD - 4 L . 
The rapid identification of cDNA ends procedure (RACE) 
was applied to the 5' end of the murine CARD-4L clone to 
elucidate the 5' end of the murine CARD-4L cDNA. Fig. 15 

30 shows the murine CARD - 4 L nucleotide sequence (SEQ ID 
NO:42), Figure 16 shows the murine CARD-4L amino acid 
sequence (SEQ ID NO: 43) , and Fig. 17 shows an alignment 
of the murine CARD-4L and human CARD-4L amino acid 
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sequences generated by the Clustal program using a PAM250 
residue weight table. 



Example 6 : Identification of the chromosomal location 
of human CARD -4 . 

5 To determine the chromosomal location of the human 

CARD-4 gene, the polymerase chain reaction carried out 
with human CARD-4 -specif ic primers card4t, with the 5' to 
3' sequence agaaggtctggtcggcaaa (SEQ ID NO:44), and 
card4k / with the 5' to 3' sequence aagccctgagtggaagca 
10 (SEQ ID NO:45), was used to screen DNAs from a 

commercially available somatic cell hybrid panel. This 
analysis showed that human CARD-4 maps to chromosome 7 
close to the SHGC-31928 genetic marker. 

Example 7: Identification of CARD- 3 in a yeast 
15 two-hybrid screen for proteins that physically interact 
with the CARD domain of human CARD-4. 

DNA encoding amino acids 1-145 of human CARD-4 
comprising the CARD domain was cloned into a yeast 
two-hybrid screening vector to create a CARD-4 , 1 - 145-GAL4 

2 0 DNA-binding domain fusion for two-hybrid screening. The 
CARD-4 , 1-14 5-GAL4 DNA-binding domain fusion was used to 
screen human mammary gland and human prostate two -hybrid 
libraries for gene products that could physically 
associate with CARD-4 , 1-145 . Twelve library plasmids 

25 expressing CARD4, 1-145 interacting proteins were found to 
contain the CARD-domain containing protein CARD- 3 thus 
establishing a direct or indirect physical interaction 
between CARD-4 and CARD- 3 . 

In addition, DNA encoding amino acids 435-540 of 

30 CARD-3 comprising the CARD domain of CARD-3 (SEQ ID NO: 6) 
was cloned into a yeast two-hybrid GAL4 transcriptional 
activation domain fusion vector to create a CARD-3 ,435- 
54 0-GAL4 transcriptional activation domain fusion. To 
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test whether the CARD domain of CARD- 3 binds 
CARD-4 , 1-145, the CARD-3 , 435-540-GAL4 transcriptional 
activation domain fusion expression vector and the 
CARD- 4 , 1-14 5-GAL4 DNA-binding domain fusion vector were 
5 cotransf ormed into a two-hybrid screening Sac char omyces 
cerevisiae (yeast) strain. The resulting cot ransf ormed 
yeast strain expressed the two reporter genes that 
indicate a physical interaction between the two hybrid 
proteins in the experiment, in this case, the CARD-3, 435- 
10 540-GALi4 transcriptional activation domain fusion protein 
and the CARD-4 , 1-145-GAL4 DNA-binding domain fusion 
protein. This experiment established a physical 
interaction between the CARD domain of CARD-3 and the 
CARD domain of CARD-4 . 

15 Example 8: Identification of hNUDC in a yeast 

two-hybrid screen for proteins that physically interact 
with the LRR domain of human CARD-4. 

DNA encoding amino acids 4 06-953 of human CARD - 4 L 
comprising the LRR domain was cloned into a yeast 

2 0 two-hybrid screening vector to create a 

CARD-4 , 406-953-GAL4 DNA-binding domain fusion for 
two-hybrid screening. The CARD-4 , 4 06- 953 -GAL4 
DNA-binding domain fusion was used to screen a human 
mammary gland two-hybrid library for gene products that 

25 could physically associate with CARD-4 , 406 - 953 . One 
library plasmid expressing a CARD4 , 406-953 interacting 
protein was found to contain the hNUDC protein, the human 
ortholog of the rat NUDC protein that has been implicated 
in nuclear movement (Morris et al . , Curr. Biol. 8:603 

30 [1998], Morris et al . , Exp. Cell Res. 238:23 [1998]), 
thus establishing a physical interaction between CARD-4 
and hNUDC. 
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Example 9: Discovery of regulation by CARD-4 of NF-kB. 

The first group of experiments described in this 
Example were carried out to determine if CARD- 4 can 
activate the NF-kB pathway. CARD-4 regulation of the 
5 NF-kB pathway is of interest because the NF-kB pathway is 
involved in many diseases described in (New England 
Journal of Medicine 336:1066 [1997]) and (American 
Journal of Cardiology 76:18C [1995]) and other references 
known to those skilled in the art. Participation of 
10 CARD-4 in the NF-kB pathway would make CARD-4 an 

attractive target for drugs that modulate the NF-kB 
pathway for treatment of NF-kB pathway-dependent 
diseases, conditions, and biological processes. 

The first group of experiments showed specific 
15 CARD-4 -mediated NF-kB pathway induction. 

The second group of experiments described in this 
Example were carried out to determine if CARD-3, the NIK 
serine/threonine protein kinase (Su et al., EMBO J. 
16:1279 [1997]), or the signal transduction protein TRAF6 
20 (Cao et al . , Nature 383:443 [1996]), proteins known to 
participate in the induction of NF-kB (McCarthy et al . , 
J. Biol. Chem. 273:16968 [1998]), are involved in 
transducing the CARD- 4 -dependent NF-kB pathway induction 
signal. It was found that CARD-3, NIK, and TRAF6 are all 
2 5 involved in transducing the CARD-4 -mediated NF-kB pathway 
induction signal. 

In nine transfection experiments, 293T cells 
coexpressing an NF-acB reporter plasmid and either pCI, 
pCI -CARD-4L (expressing CARD-4L) , pCI-CARD-4S (expressing 
30 CARD-4S) , pCI-APAFL (expressing Apaf- 1 ) , pCI-APAFS 
(expressing an Apaf-1 variant lacking WD repeats) , 
pCI-CARD-4LnoCARD (expressing CARD-4L without a CARD 
domain) , pC I - CARD4 LnoLRR (expressing CARD-4L without a 
L.RR) , pCI-CARD4LCARDonly (expressing CARD-4L CARD domain 
35 only) , or pCI -CARD4NBSonly (expressing CARD-4L nucleotide 
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binding sequence only) were created. 293T cells cells 
were plated in 6-well plates (35 mm wells) and 
transfected 2 days later (90% confluency) with 1 ug of 
NF-kB luciferase reporter plasmid (pNF- /cB-Luc , 
Stratagene) , 200 ng of pCMV £-gal, 600 ng of pCI vector 
and 200 ng of indicated expression plasmids using 
SuperFect transfection reagent (Qiagen) . For dominant- 
negative experiments, 2 ng of CARD 4 expressing plasmid 
and 800 ng of dominant -negative plasmid were used. Cells 
were harvested 48 h after transfection and' luciferase 
activity in 1000-fold diluted cell extracts was 
determined using the Luciferase Assay System (Promega) . 
In addition, S-galactosidase activities were determined 
and used to normalize transfection efficiency. 

Relative luciferase activity was determined at the 
end of the experiment to assess NF-kB pathway activation 
by the gene expressed by the pCI -based plasmid in each 
transfected cell line. The cell lines containing pCI , 
pCI-APAFS, pCI-APAFL, pCI -CARD- 4LnoCARD , and pCI- 
CARD4NBSonly had similar baseline levels of luciferase 
expression but the cell lines containing pCI-CARD-4L, 
pCI -CARD4LnoLRR, and pC I - CARD4 LCARDonl y had luciferase 
expression about nine fold elevated relative to baseline 
and the cell line containing pCI-CARD4S had luciferase 
expression sixteen fold elevated relative to baseline. 
This result demonstrates induction by CARD - 4 S and CARD- 4 L 
of the NF-kB pathway. This CARD-4 mediated NF-kB pathway 
induction is dependent on the CARD-4 CARD domain because 
the pCI -CARD-4noCARD construct expressing CARD-4 lacking 
its CARD domain did not induce the luciferase reporter 
gene and pCI -CARD4LCARDonly expressing the CARD-4 CARD 
domain did induce the luciferase reporter gene. Also, 
the CARD-4 LRR domains are not required for NF-/cB pathway 
activation because pCI - CARD4LnoLRR expressing a CARD-4 
mutant protein lacking LRR domains is able to induce the 
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lucif erase reporter gene. In addition, the CARD-4 NBS 
domain is not sufficient for NF-kB pathway activation 
because pCI-CARD4NBSonly expressing CARD-4 NBS domain is 
not able to induce the lucif erase reporter gene. In 
addition, the induction of the NF-kB pathway by CARD-4 is 
specific, as neither Apaf -expressing construct in this 
experiment induced lucif erase activation. 

In five transfection experiments, 293T cells 
coexpressing an NF-/cB reporter plasmid (NF- kB- lucif erase , 
Stratagene) and pCI-CARD-4L and either, no vector, 
pCI-TRAF6-DN (expressing a dominant negative version of 
TRAF-6) , pCI-NIK-DN (expressing a dominant negative 
version of NIK kinase) , pC I -CARD3 CARDonly (expressing the 
CARD domain of CARD-3, which acts as a dominant negative 
version of CARD-3) , or pCI-Bcl-XL (expressing the anti- 
apoptotic protein Bel -XL) were created. TRAF6-DN, NIK- 
DN, and CARD3 - CARDonly are dominant negative alleles of 
the TRAF6 , NIK, and CARD 3 genes, respectively. After 48 
hours, cells were lysed and the relative lucif erase 
activity was determined (Promega Kit) to assess NF-eB 
pathway activation by the genes expressed by the one or 
two pCI -based plasmids in each transfected cell line. The 
cell lines containing pCI -CARD-4L only or pCI -CARD-4L and 
pCI-Bcl-XL had relative lucif erase reporter gene 
expression of about 18 units. The cell lines containing 
pCI-CARD-4L and pCI-TRAF6-DN, pCI-CARD-4L and pCI-NIK-DN, 
or pCI - CARD - 4 L and pCI -CARD 3 CARDonly had relative 
lucif erase reporter gene expression of about 4 units. 
Inhibition of CARD-4L-mediated NF-kB pathway induction by 
TRAF6-DN, NIK-DN, and CARD- 3 CARDonly is specific as Bcl- 
XL» did not inhibit CARD-4L-mediated NF-kB pathway 
induction. 

These results demonstrate that dominant negative 
alleles of TRAF6, NIK and CARD-3 expressed, respectively, 
from pCI -TRAF6-DN, pCI -NIK-DN, and pCI - CARD 3 CARDonly 
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block induction of the NF-kB reporter gene by CARD - 4 L 
expression (pCI -CARD-4L) and suggest that TRAF6 , NIK, and 
CARD- 3 act downstream of CARD - 4 L to transduce the CARD - 4 L 
NF-kB pathway induction stimulus. 
5 In an additional experiment, coexpression of CARD- 

4 and the CARD domain of RICK revealed that the CARD 
domain of RICK functions as a dominant negative mutant 
suggesting that RICK is a downstream mediator of CARD-4 
function . 

10 Example 10: Discovery of CARD-4 enhancement of caspase 
9 activity. 

In ten transfection experiments, 293T cells 
coexpressing a beta galactosidase-expressing plasmid 
(pCMV /3-gal from Stratagene) as a marker for viable cells 

15 and either pCI , pCI-CARD-3, pCI-APAF, pCI -CARD-4L , 

pCI - CARD-4 S , pCI - CARD4 LnoLRR , pCI - CARD4NBSonly , pCI - 
CARD4 LCARDonly , pC I - CARD - 4 LnoCARD or pCI-casp9 
(expressing caspase-9) were created. Transf ect ions 
included 4 00 ng of pCMV /S-gal, 800 ng of expression 

2 0 plasmid, and Superfect transfection reagent from Qiagen 
and were carried out according to the manufacturer's 
directions. After 40-48 hours, cells were fixed and 
stained for beta-galactosidase expression and cell 
viability was determined by counting the number of beta 

25 galactosidase positive cells. Expression of pCI , pCI- 
CARD-3, pCI-APAF, pCI-CARD-4L, pCI-CARD-4S, pCI- 
CARD4 LnoLRR , pCI -CARD4NBSonly , pC I -CARD4 LCARDonly , and 
pCI -CARD-4LnoCARD did not result in loss of cell 
viability. As expected, expression of pCI-casp9 in 293T 

30 cells resulted in a loss of viability of about 75% of the 
cells in the experiment . 

It was next tested whether pCI, pCI-CARD-3, 
pCI-APAF, pCI -CARD-4L, pCI-CARD-4S, pC I -CARD4 LnoLRR, pCI - 
CARD4NBSonly, pC I -CARD4 LCARDonly , or pCI -CARD-4LnoCARD 
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would regulate caspase 9-mediated apoptosis. In nine 
transfection experiments, 293T cells coexpressing a beta 
galactosidase-expressing plasmid as a marker for viable 
cells, pCI-casp9, and either pCI , pCI-CARD-3, pCI-APAF, 
5 pCI - CARD - 4 L , pCI -CARD-4S , pCI - CARD4 LnoLRR , pCI- 

CARD4NBSonly / pCI - CARD4LCARDonly , and p C I - CARD - 4 LnoCARD 
were created. After 40-48 hours, cells were fixed and 
stained for beta-galactosidase expression and cell 
viability was determined by counting the number of beta 
10 galactosidase positive cells. Expression of pCI, 

pCI- CARD -4 LnoCARD, and pCI -CARD4NBSonly in the caspase 
9-expressing 293T cells had no effect on the caspase 
9-induced apoptosis. However, pCI-CARD-3 , pCI-CARD-4L, 
pCI- CARD-4 S, pC I -CARD4 LnoLRR, pCI -CARD4LCARDonly and, as 
15 expected, pCI-APAF enhanced the level of caspase 

9-induced apoptosis to 20 or less beta galactosidase 
positive cells per experiment from about 100 beta 
glactosidase positive cells per experiment. 

This experiment demonstrated that CARD-4 can 
2 0 enhance caspase 9-mediated apoptosis because coexpression 
of CARD-4L or CARD - 4 S with caspase- 9 dramatically 
increases caspase-9 mediated apoptosis. Furthermore, the 
CARD-4 CARD domain (SEQ ID NO: 10) is necessary and 
sufficient for CARD- 4 -mediated enhancement of caspase-9- 
25 potentiated apoptosis because CARD-4L lacking its CARD 
domain (pCI-CARD-4LnoCARD) does not enhance caspase-9- 
mediated apoptosis while the CARD-4 CARD domain expressed 
alone (pCI-CARD4LCARDonly) does induce caspase-9 mediated 
apoptosis. In addition, the LRR present in CARD-4 is not 
30 required for CARD-4 enhancement of caspase-9-mediated 

apoptosis because expression of a CARD-4 protein lacking 
the LRR ( pC I -CARD4 LnoLRR) still enhances caspase- 9- 
mediated apoptosis. The CARD-4 NBS is not sufficient for 
CARD-4 enhancement of caspase- 9 -mediated apoptosis 
35 because expression of the CARD-4 NBS only (pCI- 
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CARD4NBSonly ) does not enhance caspase-9 mediated 
apoptosis. This experiment also demonstrates that CARD- 3 
can enhance caspase- 9 -mediated apoptosis. 

As detailed below in Example 12, CARD-4 does not 
appear to interact directly with caspase-9, suggesting 
that potentiation of caspase-9 activity by CARD-4 is 
mediated by activation of downstream pathways. 

Example 11: Identification and tissue distribution of 
mRNA species expressed by the human. CARD-4 gene. 

Northern analysis of mRNAs extracted from adult 
human tissues revealed a 4.6 kilobase mRNA band that was 
expressed in most tissues examined. Highest expression 
was observed heart, spleen, placenta and lung. CARD-4 was 
also observed to be expressed in fetal brain, lung, liver 
and kidney. Cancer cell lines expressing the 4.6 kilobase 
CARD-4 mRNA include HeLa, K562, Molt4, SW480, A549 and 
melanoma. A larger 6.5 to 7.0 kilobase CARD-4 mRNA was 
expressed in heart, spleen, lung, fetal lung, fetal 
liver, and in the Molt4 and SW480 cell lines. 

Example 12: Physical association of CARD-4 with CARD-3. 

CARD-4 -specif ic PCR primers with the 3' primer 
encoding the HA epitope tag were used to amplify the 
CARD- 4 L gene epitope tagged with HA and this PCR product 
was cloned into the mammalian expression vector pCI . 
CARD-3 -specif ic PCR primers with the 5' primer encoding 
the MYC epitope tag were used to amplify the CARD- 3 gene 
epitope tagged with MYC and this PCR product was cloned 
into the mammalian expression vector pCI . CARD-3-specif ic 
PCR primers with the 5' primer encoding the MYC epitope 
tag were used to amplify the CARD-3 gene lacking the CARD 
domain (SEQ ID NO: 6) epitope tagged with MYC and this PCR 
product was cloned into the mammalian expression vector 
pCI . Caspase 9-specific PCR primers with the 3 7 primer 
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encoding the MYC epitope tag were used to amplify the 
caspase 9 gene epitope tagged with MYC and this PCR 
product was cloned into the mammalian expression vector 
pCI . In three transfection experiments, 293T cells 
5 coexpressing pCI -CARD-4LcHA and either pCI-CARD3nMYC, 
pCI-CARD3noCARDnMYC, or pCI -casp9cMYC were created. 
Cells from each transfected line were lysed and an 
immunoprecipitation procedure was carried out on each 
lysate with an anti-MYC epitope tag antibody to 

10 precipitate the CARD-4LcHA expressed by each cell line 
and any physically associated proteins. 
Immunoprecipitated proteins were separated by 
electrophoresis on denaturing polyacryl amide gels, 
transferred to nylon filters, and probed with an ant i -HA 

15 epitope tag antibody in a Western blotting experiment to 
determine whether the MYC-tagged protein that was 
coexpressed with the CARD-4LcHA protein had 
coimmunoprecipitated with the CARD-4LcHA protein. In 
this experiment, CARD-3 was found to coimmunoprecipitate 

20 with CARD -4 while CARD-3 lacking its CARD domain and 

caspase-9 did not coimmunoprecipitate with CARD- 4 . This 
experiment demonstrates that CARD-4 and CARD-3 physically 
associate and that CARD-3 requires its CARD domain to 
associate with CARD-4. In addition, CARD-4 appears to 

25 not associate with caspase-9. 

Example 13: CARD-4 Genomic Sequence 

Figure 18 is depicts the 32042 nucleotide genomic 
sequence of CARD-4. This sequence is based the CARD-4 
cDNA sequence described above and a BAC sequence (DBEST 
30 Accession No. AC006027) . The CARD-4 cDNA sequence 

described above was used to correct three errors in the 
BAC sequence, including one error resulting in a 
frameshift. The CARD-4 genomic sequence of Figure 18 
includes the following introns and exons: exon 1: 
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nucleotides 364-685, encoding amino acids 1-67 (start 
codon at nucleotides 485-487)/ intron 1: nucleotides 
686-2094; exon 2: nucleotides 2095-2269, encoding amino 
acids 67-126; intron 2: nucleotides 2270-4365; exon 3: 
5 nucleotides 366-6190, encoding amino acids 126-734; 
intron 3: nucleotides 6191-9024; exon 4: nucleotides 
9025-9108, encoding amino acids 734-762; intron 4: 
nucleotides 9109-10355; exon 5: nucleotides 10356-10439, 
encoding amino acids 762-790; intron 5: nucleotides 

10 10440-11181; exon 6: nucleotides 1182-11265, encoding 
amino acids 790-818; intron 6: nucleotides 11266-19749; 
exon 7: nucleotides 19750-19833, encoding amino acids 
818-846; intron 7: nucleotides 19834-21324; exon 8: 
nucleotides 21325-21408, encoding amino acids 846-874; 

15 intron 8: nucleotides 21409-24226; exon 9: nucleotides 
24227-24310, amino acids 874-903; intron 9: nucleotides 
24311-27948; exon 10: nucleotides 27949-28032, amino 
acids 903-930; intron 10: nucleotides 28033-31695; exon 
11: nucleotides 31696-32024, encoding amino acids 930-953 

20 (stop codon at nucleotides 31766-31768) . 

The introns in the CARD-4 genomic sequence contain 
consensus splice donor and acceptor sites (Molecular Cell 
Biology, Darnell et al . , eds.,1996). The CARD-4 genomic 
sequence is usful for genetic identification and mapping 

25 and identifying mutations, e.g., mutations is splice 
donor or splice acceptor sites. 

Equivalents 

Those skilled in the art will recognize, or be 
able to ascertain using no more than routine 
3 0 experimentation, many equivalents to the specific 

embodiments of the invention described herein. Such 
equivalents are intended to be encompassed by the 
following claims. 
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What is claimed is : 

1. An isolated nucleic acid molecule selected 
from the group consisting of: 

a) a nucleic acid comprising the nucleotide 
5 sequence of SEQ ID NO: 38 or a complement thereof; 

b) a nucleic acid comprising the nucleotide 
sequence of SEQ ID NO: 40 or a complement thereof; 

c) a nucleic acid comprising the nucleotide 
sequence of SEQ ID NO: 42, or a complement thereof; 

10 d) a nucleic acid molecule which encodes a 

polypeptide comprising the amino acid sequence of SEQ ID 
NO: 38; 

e) a nucleic acid molecule which encodes a 
polypeptide comprising the amino acid sequence of SEQ ID 

15 NO:40; 

f) a nucleic acid molecule which encodes a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO: 42 ; 

g) a nucleic acid comprising the nucleotide 
2 0 sequence of SEQ ID NO: 38 or a complement thereof; 

h) a nucleic acid consisting of the nucleotide 
sequence of SEQ ID NO: 40 or a complement thereof; 

i) a nucleic acid consisting of the nucleotide 
sequence of SEQ ID NO: 42, or a complement thereof; 

25 j) a nucleic acid molecule which encodes a 

polypeptide consisting of the amino acid sequence of SEQ 
ID NO: 38; 

k) a nucleic acid molecule which encodes a 
polypeptide consisting of the amino acid sequence of SEQ 
30 ID NO: 40; and 

1) a nucleic acid molecule which encodes a 
polypeptide consisting of the amino acid sequence of SEQ 
ID NO: 42. 
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2 . A host cell which contains one of the nucleic 
acid molecules of claim 1. 



3 . An isolated polypeptide selected from the 
group consisting of: 
5 a) a polypeptide comprising the amino acid 

sequence of SEQ ID NO: 39; 

b) a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 41; 

c) a polypeptide comprising the amino acid 
10 sequence of SEQ ID NO: 43; 

d) a polypeptide consisting of the amino acid 
sequence of SEQ ID NO: 3 9; 

e) a polypeptide consisting of the amino acid 
sequence of SEQ ID NO: 41; 

15 f) a polypeptide consisting of the amino acid 

sequence of SEQ ID NO: 43; 

g) a fragment of a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 39, wherein the fragment 
comprises at least 15 contiguous amino acids of SEQ ID 

20 NO:39; 

h) a fragment of a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 41, wherein the fragment 
comprises at least 15 contiguous amino acids of SEQ ID 
NO: 41; 

25 i) a fragment of a polypeptide comprising the 

amino acid sequence of SEQ ID NO: 43, wherein the fragment 
comprises at least 15 contiguous amino acids of SEQ ID 
NO: 43 ; 

j) a naturally occurring allelic variant of a 
30 polypeptide consisting of the amino acid sequence of SEQ 
ID NO: 39, wherein the polypeptide is encoded by a nucleic 
acid molecule which hybridizes to a nucleic acid molecule 
consisting of SEQ ID NO: 38 under stringent conditions; 
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k) a naturally occurring allelic variant of a 
polypeptide consisting of the amino acid sequence of SEQ 
ID NO: 41, wherein the polypeptide is encoded by a nucleic 
acid molecule which hybridizes to a nucleic acid molecule 
5 consisting of SEQ ID NO: 40 under stringent conditions; 
and 

1) a naturally occurring allelic variant of a 
polypeptide consisting of the amino acid sequence of SEQ 
ID NO: 43, wherein the polypeptide is encoded by a nucleic 
10 acid molecule which hybridizes to a nucleic acid molecule 
consisting of SEQ ID NO: 42 under stringent conditions. 

4 . An antibody which selectively binds to any 
one of the polypeptides of claim 3 . 

5. A method for producing a polypeptide selected 
15 from the group consisting of: 

a) a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 39; 

b) a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 41; 

20 c) a polypeptide comprising the amino acid 

sequence of SEQ ID NO: 43; 

d) a polypeptide consisting of the amino acid 
sequence of SEQ ID NO: 39; 

e) a polypeptide consisting of the amino acid 
25 sequence of SEQ ID NO: 41; 

f ) a polypeptide consisting of the amino acid 
sequence of SEQ ID NO:43; 

g) a fragment of a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 39, wherein the fragment 

30 comprises at least 15 contiguous amino acids of SEQ ID 
NO : 3 9 ; 

h) a fragment of a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 41, wherein the fragment 
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comprises at least 15 contiguous amino acids of SEQ ID 
NO : 4 1 ; 

i) a fragment of a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 43, wherein the fragment 
5 comprises at least 15 contiguous amino acids of SEQ ID 
NO: 43 ; 

j) a naturally occurring allelic variant of a 
polypeptide consisting of the amino acid sequence of SEQ 
ID NO: 39, wherein the polypeptide is encoded by a nucleic 

10 acid molecule which hybridizes to a nucleic acid molecule 
consisting of SEQ ID NO: 38 under stringent conditions; 

k) a naturally occurring allelic variant of a 
polypeptide consisting of the amino acid sequence of SEQ 
ID NO: 41, wherein the polypeptide is encoded by a nucleic 

15 acid molecule which hybridizes to a nucleic acid molecule 
consisting of SEQ ID NO: 40 under stringent conditions; 
and 

1) a naturally occurring allelic variant of a 
polypeptide consisting of the amino acid sequence of SEQ 

2 0 ID NO: 43, wherein the polypeptide is encoded by a nucleic 

acid molecule which hybridizes to a nucleic acid molecule 
consisting of SEQ ID NO: 42 under stringent conditions; 

comprising the step of culturing the host cell of 
claim 2 under conditions in which the nucleic acid 
25 molecule is expressed. 

6. A method for detecting the presence of a 
polypeptide of claim 2 in a sample, comprising: 

a) contacting the sample with a compound which 
selectively binds to a polypeptide of claim 2; and 

3 0 b) determining whether the compound binds to the 

polypeptide of claim 2 in the sample. 
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7. A method for detecting the presence of a 
nucleic acid molecule of claim 1 in a sample, comprising 
the steps of : 

a) contacting the sample with a nucleic acid probe 
or primer which selectively hybridizes to the nucleic 
acid molecule; and 

b) determining whether the nucleic acid probe or 
primer binds to a nucleic acid molecule in the sample. 

8. A method for identifying a compound which 
binds to a polypeptide of claim 2 comprising the steps 
of: 

a) contacting a polypeptide, or a cell 
expressing a polypeptide of claim 2 with a test compound; 
and 

b) determining whether the polypeptide binds to 
the test compound. 

9. The method of claim 8, wherein the binding of 
the test compound to the polypeptide is detected by a 
method selected from the group consisting of: 

a) detection of binding by direct detecting of 
test compound/polypeptide binding; 

b) detection of binding using a competition 
binding assay; and 

c) detection of binding using an assay for 
CARD-4L or CARD-4S mediated signal transduction. 

10. A method for modulating the activity of a 
polypeptide of claim 2 comprising contacting a 
polypeptide or a cell expressing a polypeptide of claim 2 
with a compound which binds to the polypeptide in a 

0 sufficient concentration to modulate the activity of the 
polypeptide . 
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11. A method for identifying a compound which 
modulates the activity of a polypeptide of claim 2, 
comprising : 

a) contacting a polypeptide of claim 2 with a 
5 test compound; and 

b) determining the effect of the test compound 
on the activity of the polypeptide to thereby identify a 
compound which modulates the activity of the polypeptide. 

12. A method for identifying a compound that 
blocks the interaction between a CARD-4 protein 
comprising a CARD-4 domain and a CARD-4 - interact ing 
protein comprising the steps of: 

a) incubating said CARD-4 protein and said 
interactor in the presence and absence of a test agent; 
b) determining whether said test agent reduces the 

binding of said CARD-4 protein and said interactor; and 

c) identifying a compound that blocks the 
interaction of said CARD-4 protein with said interactor 
when said compound reduces the binding of said CARD-4 
protein with said interactor; 

wherein said interactor is selected from the group 
consisting of CARD- 3 and hNUDC and wherein said CARD-4 
domain comprises amino acids 1-145 of an amino acid 
sequence selected from the group consisting of SEQ ID 
NO: 8 and SEQ ID NO: 43. 

13. The method of claim 12, wherein the CARD-4 
protein comprising a CARD-4 domain is selected from the 
group consisting of: 

a) a polypeptide comprising the amino acid 
3 0 sequence of SEQ ID NO: 8; 

b) a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 39; 



15 



20 
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c) a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 41; and 

d) a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 43. 



5 14. The method of claim 12, wherein the CARD-4 

protein and interactor are expressed in a recombinant 
prokaryotic or eukaryotic cell line or wherein the CARD-4 
protein and interactor are isolated proteins or present 
in cell -free protein extracts. 

0 15. A method for identifying a compound that 

inhibits the induction of the NF-kB pathway by a CARD-4 
protein comprising the steps of: 

a) incubating a recombinant cell line containing 
a vector expressing CARD-4 in the presence and absence of 

5 a test agent ; 

b) determining whether said test agent inhibits 
the induction of the NF-/cB pathway by CARD-4; and 

c) identifying a compound that inhibits the 
induction of the NF-kB pathway by CARD-4. 

0 16. The method of claim 15, wherein the CARD-4 

protein comprising a CARD-4 domain is selected from the 

group consisting of: 

a) a polypeptide comprising the amino acid 

sequence of SEQ ID NO: 8; 
5 b) a polypeptide comprising the amino acid 

sequence of SEQ ID NO: 39; 

c) a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 41; and 

d) a polypeptide comprising the amino acid 
0 sequence of SEQ ID NO: 43. 
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17. The method of claim 15 further comprising: 
a) incubating a recombinant cell line expressing 
CARD- 4 and also expressing an NF-kB pathway reporter gene 
in the presence and absence of a test agent; 
5 b) determining whether said test agent inhibits 

the induction of the NF-eB pathway reporter gene by 
CARD- 4; and 

c) identifying a compound that inhibits the 
induction of the NF-kB pathway reporter gene by CARD- 4 . 

10 18. A method for identifying a compound that 

inhibits the enhancement of caspase 9 activity by a 
CARD- 4 protein comprising a CARD-4 domain comprising the 
steps of : 

a) incubating a recombinant cell line expressing 
15 caspase 9 and CARD-4 in the presence and absence of a 

test agent ; 

b) determining whether said test agent inhibits 
caspase 9 activity; and 

c) identifying a compound that inhibits the 
20 enhancement of caspase 9 activity by a CARD-4 protein. 

19. The method of claim 18 further comprising: 

a) incubating a recombinant cell line expressing 

caspase 9 and CARD-4 and a beta-galactosidase expression 

vector in the presence and absence of a test agent ; 
25 b) determining whether the presence of said test 

agent increases the proportion of cells that stain 

positive for beta-galactosidase; and 

c) identifying a compound that inhibits the 

enhancement of caspase 9 activity by a CARD-4 protein by 
3 0 identifying a compound that increases the proportion of 

cells that stain positive for beta-galactosidase. 
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20. The method of claim 18, wherein the CARD-4 
protein comprising a CARD-4 domain is selected from the 
group consisting of: 

a) a polypeptide comprising the amino acid 
5 sequence of SEQ ID NO: 8; 

b) a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 39; 

c) a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 41; and 

10 d) a polypeptide comprising the amino acid 

sequence of SEQ ID NO:43. 

21. A method for identifying a compound that 
inhibits the enhancement of caspase 9 activity by a 
CARD- 3 protein comprising a CARD- 3 domain comprising the 

15 steps of: 

a) incubating a recombinant cell line expressing 
caspase 9 and CARD- 3 in the presence and absence of a 
test agent ; 

b) determining whether said test agent inhibits 
2 0 caspase 9 activity; and 

c) identifying a compound that inhibits the 
enhancement of caspase 9 activity by a CARD- 3 protein. 

22. The method of claim 21 further comprising: 

a) incubating a recombinant cell line expressing 
25 caspase 9 and CARD-3 and a beta-galactosidase expression 

vector in the presence and absence of a test agent; 

b) determining whether the presence of said test 
agent increases the proportion of cells that stain 
positive for beta-galactosidase; and 
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c) identifying a compound that inhibits the 
enhancement of caspase 9 activity by a CARD- 3 protein by 
identifying a compound that increases the proportion of 
cells that stain positive for beta-galactosidase . 
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ZCACGCG7CCSG7CAGC7C7GG7TC3GAGAAGCAGCGGCT3GCG7GGGCC^.TCZGGGGAA7GGGC 

GCCGTCGTGACCTAGTGTTGCGGGGCAAAAAGGGTCTTGCCGGCCTCGCTCGTGCAGGGGCGTAT 

rTGGGCGCCTGAGCGCGGCGTGGGAGCCTTGGGAGCCGCCGCAGCAGGGGGGACACCCGGAACCG 

GCCT3AGCGCCCGGGACCATGAACGGGGAGGCCATCTGCAGCGCCCTGCCCACCATTCZCTACCA 

rAAACTCGCCGACCTGCGCTACCTGAGCCGCGGCGCCTCTGGCACTGTGTCGTCCGCCZGCCACG 

CAGACT3GCGCGTCCAGGTGGCCGTGAAGCACCTGCACATCCACACTCCGC73CTCGACAGTGAA 

AGAAAGGA7GTCTTAAGAGAAGC73AAAT77TACACAAAGCTAGATTTAG77ACATTCT7CCAA7 

r77GGGAA77TGCAATGAGCC7GAATTT77GGGAATAGT7AC7GAATACA7GCCAAA7GGAXCA7 

7AAA7GAACTCC7ACA7AGGAAAAC73AA7ATCCTGATG7TGC77GGCCA77GAGA777C3 7A7C 

C7GCATGAAATTGCCCTTGGXGTAAATTACCTGCACAATATGACTCCTCCT7TAC77CA7CATGA 

C7TGAAGAC7CAGAATATC77A77GGACAA7GAA777CAXG7TAAGAT7GCAGA7T7TGG777A7 

CAAAG7GGCGCATGATG7CCC7C7CACAG7CACGAAG7AGCAAA7CTGCACCAGAAGGAGGGACA 

A7TATCTA7ATGCCACC7GAAAAC7ATGAACC7GGACAAAAAXCAAGGGCCAGTA7CAAGCACGA 

^A7A7A7AGCTATGCAGTTA7CACA7GGGAAG7G7TATCCAGAAAACAGCC77TTGAAGA7G7CA 

CCAA7CC777GCAGA7AA7G7A7AG7G7G7CACAAGGACA7CGACC7G77A77AA7GAAGAAAG7 

TTGCCA7A7GA7A7ACC7CACCGAGCACG7A7GA7C7C7C7AA7AGAAAG7GGA7GGGCACAAAA 

7CCAGA7GAAAGACCA7CT7TC77AAAATGTTTAATAGAACT7GAACCAG7TTTGAGAACATTTG 

AAGAGA7AAC7777C77GAAGC7G77A77CAGC7AAAGAAAACAAAG77ACAGAG7G777CAAG7 

Z C CA77CAC C7AXG7GACAAGAAGAAAA7GGAA77A7C7373AAC A7AC C737AAA7CA7GG7C C 

ACAAGAGGAATCATGTGGATCCTCTCAGCTCCATGAAAATAGTGGTTC7CCTGAAAC7TCAAGGT 

CZZ7GCCAGC7CZ7ZAAGACAA7GA777777A7C7AGAAAAGC7ZAAGAC7G77A7777A7GAAG 

ZTGCA7C^C7G7CC7GGAAA7CACAG77GGGA7AGCACC\777C7GGA7C7CAAAGGGC7GCA77 

27G7GA7CACAAGACCA77CCA7GC7C77CAGCAA7AA7AAA7CCACTCTCAACTGCAGGAAACT 

CAGAACG7CTGCAGCCTGG7A7AGCCCAGCAGTGGATCCAGAGCAAAAGGGAAGACA7TG7GAAC 

CAAA7GACAGAAGCC7GCC77AACCAG7CGC7AGATGCCCTTC7G7CCAGGGAC77GA7CA7GAA 

AGAGGAC7A7GAAC77G77AG7ACCAAGCC7ACAAGGACC7CAAAAG7CAGACAA77AC7AGACA 

C7AC7GACA7CCAAGGAGAAGAAT77GCCAAAG77A7AG7ACAAAAA77GAAAGA7AACAAACAA 

ATGGG7C77CAGCC7TACCCGGAAA7AC77GTGG7TTCTAGA7CACCATCTT7AAA777AC77CA 

AAA7AAAAGCA7G7AAG7GAC7377777CAAGAAGAAA7G7G777CA7AAAAGGA7A777A7AAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA;^^ ( S£ Q ID NO:l) 



FIG. 1 
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His 


Asp 


Leu 


_ V s 


Thr 


G 1 n 


Asn 


He 


Leu 


Leu 


Aso 


Asn 


Glu 


Phe 


H L. s 


V a 1 


160 


L v S 






As D 


Phe 




Leu 


Ser 


_ • 3 


Tr-o 


Arg 


Met 


Met 


Ser 


Leu 


Ser 


G 1 n 


Ser 


Aro 


Ser 


130 




* "3 


Ser 


Ala 


Fro 


Glu 


Gly 


Gly 


Thr 


2 le 


lie 


Tvr 


Met 


Pro 


Pro 


Glu 


Asn 


Tvr 


Gil 


Pro 


200 


Glv 


Gin 


Lvs 


Ser 


Arg 


Ala 


Ser 


lie 


Lys 


His 


ASD 


lie 


Tvr 


Ser 


Tyr 


Ala 


Val 


lie 


Tr - 


Trp 


220 


Glu 


Val 


Leu 


Ser 


Arg 


Lys 


Gin 


Pre 


Phe 


Glu 


Asp 


Val 


Thr 


Asn 


Pro 


Leu 


Gin 


He 


Me t 


Tvr 


240 


Ser 


Val 


Ser 


Gin Gly 


His 


Arc 




Val 




Asn 


Glu 


Glu 


Ser 


Leu 


Pro 


Tvr 


Asd 




Pro 


260 


H:s 


Arg 


Ala 


Arg 


Met 




Ser 


Leu 




Glu 


Ser 


Gly 


Tro 


Ala 


Gin 


Asn 


Pro 


As D 




Ara 


280 




Ser 


?he 


Leu 


Lys 


Cvs 


Leu 




Glu 


Leu 


Glu 


Pro 


Val 


Leu 


Arg 




Phe 


Glu 




lie 


300 




Phe 


Leu 


Glu 


Ala 


Val 




Gin 


Leu 


Lys 


Lys 


T>U V 


Lys 


Leu 


Gin 


Ser 


Val 






Ala 


320 




HIS 


Leu 


Cys 


ASp 


Lys 


Lvs 


Lys 


Met 


Glu 


Leu 


Ser 


Leu 


Asn 


He 


P ro 


Val 


Asn 


His 


Gly 


340 


? ro 


•31a 


Glu 


Glu 


Ser 


Cys 


Gly 


Ser 


Ser 




Leu 


His 


Glu 


Asn 


Ser 


Gly 


Ser 








360 


Ser 


Arg 


Ser 


Leu 


Pro 


Ala 


Pro 


Gin 


Asp 


Asn 


Asp 


Phe 


Leu 


Ser 


Arg 


Lys 


Ala 


Gin 


Asp 


Cys 


380 




Phe 


Met 


Lys 


Leu 


His 


His 


Cys 


Pro 




Asn 


His 


Ser 


Trp 


Asp 


Ser 








Gly 


400 


ier 


3 In 


Arg 


Ala 


Ala 


Phe 


CV3 


Asp 


His 


Lys 






Pro 


Cys 


Ser 




Ala 






Asn 


420 




leu 


Ser 




Ala 


Gly 


Asn 


Ser 


Glu 


Arg 


Leu 


Gin 




G 1 y 


He 


Ala 


Gin 


G 1 n 




1 le 


440 






Lys 


Arg 


Glu 


Asp 




val 


Asn 


G In 


A T* 






Ala 


Cys 


Leu 


Asn 




Ser 


Leu 


460 


Asp 




Leu 


Leu 


Ser 


Arg 


ASD 


Leu 




Met 


— V 2 


Glu 


Asp 


Tyr 


Glu 


Leu 


Val 






Lys 


480 


? ro 




Arg 




Ser 


Lvs 


Val 


.Arg 


Gin 


Leu 




Asp 






Asp 


lie 


Gin 


Glv 


Glu 


Glu 


SOO 


Phe 


Ala 


Lvs 


Val 


He 


Val 


Gin 


Lys 


Leu 


Lys 


Asp 


Asn 




Gin 


Met 


Glv 


Leu 


Gin 


r- ro 


Tyr 


520 


Pre 


Glu 


lie 


Leu 


Val 


Val 




Arg 


Ser 


Pro 


Ser 


Leu 


Asn 


Leu 


Leu 


Gin 


Asn 


— y s 


Ser 


Met 


540 
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TCCTCCCTTC CTCCCGTTCC AGTGCCTGCA GGGCAGTGGT CCGGCGCGGG AAGACCTCTT 
IAAGAACAAG GATCACTTCC AGTTCACCAA CCTCTTCCTG TGCGGGCTGT TGTCCAAAGC 
CAAACAGAAA CTCCTGCGGC ATCTGG7GCC CGCGGCAGCC CTGAGGAGAA AGCGCAAGGC 
GCTGTGGGCA CACCTGTTTT CCAGCCTGCG GGGCTACCTG AAGAGCCTGC CCCGCGTTCA 
3GTCGAAAGC 7TCAACCAGG TGCAGGCCAT GCCCACGTTC ATCTGGATGC TGCGCTGCAT 
CTACGAGACA CAGAGCCAGA AGGTGGGGCA GCTGGCGGCC AGGGGCATCT GCGCCAACTA 
CCTCAAGCTG ACCTACTGCA ACGCCTGCTC GGCCGACTGC AGCGCCCTCT CCTTCGTCCT 
GCATCACTTC CCCAAGCGGC TGGCCCTAGA CCTAGACAAC AACAATCTCA AC GACTACGG 
CGTGCGGGAG CTGCAGCCCT GCTTCAGCCG CCTCACTGTT CTCAGACTCA GCGTAAACCA 
GAT C ACTGAC GGTGGGGTAA AGGTG CTAAG CGAAGAGCTG ACCAAATACA AAATTGTGAC 
CTATTTGGGT TTATACAACA ACCAGATCAC CGATGTCGGA GCCAGGTACG TCACCAAAAT 
Z CTG GATG AA TGCAAAGGCC TCACGCATCT TAAACTGGGA AAAAACAAAA TAACAAGTGA 
AGGAGGGAAG TATCTCGCCC TGGCTGTGAA GAACAGCAAA TCAATCTCTG AGGTTGGGAT 
GTGGGGCAAT CAAGTTGGGG ATGAAGGAGC AAAAGCCTTC GCAGAGGCTC TGCGGAACCA 
CCCCAGCTTG ACCACCCTGA GTCTTGCGTC CAACGGCATC TCCACAGAAG GAGGAAAGAG 
CCTTGCGAGG GCCCTGCAGC AG AACAC GTC TCTAGAAATA CTGTGGCTGA CCCAAAATGA 
ACTCAACGAT GAAGTGGCAG AGAGTTTGGC AGAAATGTTG AAAGTCAACC AGACGTTAAA 
GCATTTATGG CTTATCCAGA AT CAGATCAC AGCTAAGGGG ACTGCCCAGC TGGCAGATGC 
GTTACAGAGC AACACTGGCA TAAGAGAGAT TTGC CTAAAT GGAAACCTGA TAAAACCAGA 
GGAGGC CAAA GTCTATGAAG ATGAGAAGCG GATTATCTGT TTCTGAGAGG ATGCTTTCCT 
GTTCATGGGG TTTTTGCCCT GGAGCCTCAG C AGCAAATG C CACT CTGGGC AGTCTTTTGT 
GTCAGTGTCT TAAAGGGGCC TGCGCAGGCG GGACTATCAG GAGTCCACTG CCTYCATGAT 
GCAAGCCAGC TTCCTGTGCA GAAGGTCTGG TCGGCAAACT CCCTAAGTAC CCGCTACAAT 
T CT G C AG AAA AAGAATGTGT CTTGCGAGCT GTTGTAGTTA CAGTAAATAC ACTGTGAAGA 
GAAAAAAAAA ACGGACGCGT GG (SEQ ID NOs7) 
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7CCTCCCTTC CTCCCGTTCC AGTGCCTGCA GGGCAG7GGT CCGGCGCGGG AAGACCTCTT 
7AAGAACAAG GATCACTTCC AGTTCACCAA CCTCTTCCTG TGCGGGCTGT TGTCCAAAGC 
7AAACAGAAA CTCCTGCGGC ^TCTZGTOCC CGCGGCAGCC CTGAGGAGAA AGCGCAAGGC 
CCTGTGGGCA CACCTGTTTT CCAGCCTGCG GGGGTACCTG AAGAGCCTGC CCCGCGTTCA 
3GTCGAAAGC TTCAACCAGG 7 G C AG G C CAT GCCCACGTTC ATCTGGATGC 7GCGCTGCAT 
CTACGAGACA CAGAGCCAGA AGGTGGGGCA GCTGGCGGCC AGGGGCATCT GCGCCAACTA 
CGTCAAGCTG ACCTACTGCA ACGCCTGCTC GGCCGACTGC AGCGCCCTCT CCTTCGTCCT 
GCATCACTTC CCCAAGCGGC TGGCCCTAGA CCTAGACAAC AACAATCTCA ACGACTACGG 
CGTGCGGGAG CTGCAGCCCT GCTTCAGCCG CCTCACTGTT CTCAGACTCA GCGTAAACCA 
GATCACTGAC GGTGGGGTAA AGGTG CTAAG CGAAGAGCTG ACCAAATACA AAATTGTGAC 
CTATTTGGGT TTATACAACA ACCAGATCAC CGATGTCGGA GCCAGGTACG TCACCAAAAT 
7CTGGATGAA TGCAAAGGCC T C AC G CAT CT 7AAACTGGGA AAAAACAAAA TAACAAGTGA 
AGGAGGGAAG TATCTCGCCC TGGCTGTGAA GAACAGCAAA TCAATCTCTG AGGTTGGGAT 
GTGGGG CAAT CAAGTTGGGG ATGAAGGAGC AAAAGCCTTC GCAGAGGCTC TGCGGAACCA 
CCCCAGCTTG ACCACCCTGA GTCTTGCGTC CAACGGCATC TCCACAGAAG GAGGAAAGAG 
CCTTGCGAGG GCCCTGCAGC AGAACACGTC TCTAGAAATA CTGTGGCTGA CCCAAAATGA 
ACTCAACGAT GAAGTGGCAG AGAGTTTGGC AGAAATGTTG AAAGTCAACC AGACGTTAAA 
GCATTTATGG CTTATCCAGA AT CAGAT CAC AG CT AAGGGG ACTGCCCAGC TGGCAGATGC 
GTTACAGAGC AACACTGGCA TAACAGAGAT TTGCCTAAAT GGAAACCTGA TAAAACCAGA 
GGAGGC CAAA GTCTATGAAG ATGAGAAGCG GATTATCTGT TTCTGAGAGG ATGCTTTCCT 
GTTCATGGGG TTTTTGCCCT GG AG C CT CAG CAGCAAATGC CACTCTGGGC AGTCTTTTGT 
GTCAGTGTCT TAAAGGGGCC TGCGCAGGCG GGACTATCAG GAGTCCACTG C CTY CAT GAT 
GCAAGCCAGC TTCCTGTGCA GAAGGTCTGG TCGGCAAACT CCCTAAGTAC CCGCTACAAT 
TCTGCAGAAA AAGAATGTGT CTTGCGAGCT GTTGTAGTTA CAGTAAATAC ACTGTGAAGA 
GAAAAAAAAA ACGGACGCGT GG (SEQ ID NO: 7) 
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MEEQGKSEMEIIPSESKPKIQLLKSNRELLVTKIRNT^ 

DKVRKILDLVQSKGEEVSEFF-LYLLCQ 

^QLRHKLGRDSKFVLCYAQKEELLLEEIY!^ 

ETIFILGBAGVGKSMLLCRLQSLWATGRLBAGVKFFFHFRCRMFSCFKESDRLCLQDLLFKHYCY 
PERE-FEEVFAFLLRFPHVALFTFDGLDELKSDLDLSRVPDSSCPWEPAHPLVLLANLLSGKLLKG 
ASKLLTARTGIEVPRQFLRKiCTLLRGFSPSHLRAYARRMFPERALQDRLLSQLEANPNLCSLCSV 
?LFCWIIFRCFQHFRAAFEGS?QLPDCT!^TLTDVFLLVTEVHLNRMQPSSLVQRNTRSPVETLHA 
GRDTLCSLGQVAHRGMEKSLFVFTQEEVQASGLQERDMQLGFLRALPELGPGGDQQSYEFFHLTL 
QAFFTAFFLVLDDRVGTC^LLRFFQEWMPPAGAATTSCYFPFLPFQCLQGSGPAREDLFKNKDHF 
Q FTNLFLC GLLS KAKQ KLLRHLVP AAALRRKRKALWAHLF S S LRGYLKS LPRVQVE S FNQVQAMP 
TFIVmLRCIYETQSQK^GQIAARGICANYLKLTYCNACSADCSALSFVL 

bTO YGVRELQ PCFSRLTVLRLS VNQ I TDGGVKVLSEELTKYKI VTYLGLYNNQ I TDVGARYVTKI L 
DECKGL.TKLKLGKNKI TSEGGKYLALAVKNS KS I S EVGMWGNQVGDEGAKAFAEALRNHP S LTTL 
S LiASNG I S TEGGKS LARALQQNT S LE I LWLTCNELNDEVAES LAEMLKVNQTLKHLWL I QNQ I TA 
KGTAQ LAD ALQ SNTG I TE I CLNGNL I KPEEAKVYEDEKR 1 1 CF (SEQ ID NO:8) 
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IACGCG A CGGAGT7GC7GAAGAA7GAC7AC77C7CGGCCGAAGA7GCGGAGA77G7G7 

GTGCCTGCCCCACCCAGCC7GACAAGGTCCGCAAAATTCTGGACCTGGTACAGAGCAAG 

GGCGAGGAGGTGTCCGAGTTCTTCCTCTACTTGCTCCAGCAACTCGCAGATGCCTACG^ 

GGACCTCAGGCCTTGGCTGCTGGAGATCGGCTTCTCCCCTTCCCTGCTCACTCAGAGCA 

.\AGTCGTGGTCAACACTGACCCAGTGAGCAGGTATACCCAGCAGCTGCGACACCATCTG 

GGCCG7GAC7CCAAG77CG7GC7G7GCTATGCCCAGAAGGAGGAGC7GCTGC7GGAGGA 

GA7C7ACA7GGACACCA7CA7GGAGC7GG77GGC77CAGCAA7GAGAGCC7GGGCAGCC 

7GAACAGCC7GGCC7GCC7CC7GGACCACACCACCGGCA7CC7CAA7GAGCAGGG7GAG 

ACCATCTTCATCCTGGGTGA7GCTGGGGTGGGCAAGTCCATGCTGCTACAGCGGCTGCA 

GAGCCTCTGGGCCACGGGCCGGCTAGACGCAGGGGTCAAATTCTTCTTCCACTTTCGCT 

GCCGCA7G77CAGC7GC77CAAGGAAAG7GACAGGC7G7G7CTGCAGGACC7GCTC77C 

AAGCAC7AC7GC7ACCCAGAGCGGGACCCCGAGGAGG7G777GCC77CCTGC7GCGC77 

CCCCCACG7GGCCC7C77CACC7TCGA7GGGC7GGACGAGC7GCAC7CGGACT7GGACC 

7GAGCCGCG7GCCTGACAGC7CCTGCCCCTGGGAGCCTGCCCACCCCCTGGTCTTGCTG 

GCCAACCTGCTCAGTGGGAAGCTGCTCAAGGGGGCTAGCAAGCTGCTCACAGCCCGCAC 

AGGGA7GGAGG7CCCGCGCCAG77CC7GGGGAAGAAGG7GC77C7CCGGGGC77G7CGC 

CCAGCCACCTGCGCGCCTATGCCAGGAGGATGTTCCCCGAGCGGGCCCTGCAGGACCGC 

C7GCTGAGCCAGCTGGAGGCCAACCCCAACCTCTGCAGCCTGTGCTCTGTGCCCCTCTT 

CTGCTGGATCATCTTCCGG7GCTTCCAGCACTTCCG7GCTGCCTT7GAAGGCTCACCAC 

AGCTGCCCGACTGCACGATGACCCTGACAGATGTCTTCCTCCTGGTCACTGAGGTCCAT 

C7GAACAGGA7GCAGCCCAGCAGCCTGG7GGAGCGGAACACACGCAGCCCAG7GGAGAC 

CCTCCACGCCGGCCGGGACACTCTGTGCTCGCTGGGGCAGGTGGCCCACCGGGGCATGG 

AGAAGAGCCTCT7TG7C77CACCCAGGAGGAGG7GCAGGCC7CCGGGC7GCAGGAGAGA 

GACA7GCAGCTGGGC77CC7GCGGGC777GCCGGAGC7GGGCCCCGGGGG7GACCAGCA 

G7CG7A7GAG777T7CCACC7GAGCC7CCTCACC7G7AAAACTGGGA7CCCAG7A7AGA 

G777GGAAA7CAG7AGACACCA7A7GC77CAAAAAACAGGGGC7A77AAAA7GACA7CA 

GGAGCCAGAAAGTCTCATGGCTGTGCTTTCTCTTGAAGTTTATACAACAACCAGATCAC 

CGA7G7CGGAGCCAGAC7GGGAAAAAACAAAA7AACAAG7GAAGGAGGGAAG7A7C7GG 

CGC7GGC7G7GAAGAACAGCAAA7CAA7C7C7GAGG77GGGA7G7GGGGCAA7CAAG77 

GGGGA7GAAGGAGCAAAAGCC77CGCAGAGGC7C7GCGGAACCACCCCAGC77GACCAC 

CC7GAG7C7TGCGTCCAACGGCATCTCCACAGAAGGAGGAAAGAGCCTTGCGAGGGCCC 

TGCAGCAGAACACG7C7C7AGAAA7AC7G7GGC7GACCCAAAA7GAAC7CAACGATGAA 

G7GGCAGAGAG777GGCAGAAA7G7TGAAAG7CAACCAGACG77AAAGCAT77A7GGC7 

7A7CCAGAA7CAGA7CACAG7CTTTTG7G7CAG7G7C77AAAGGGGCCTGCGCAGGCGG 

GAC7ATCAGGAGTCCACTGCC7CCATGATGCAAGCCAGCTTCCTGTGCAGAAGGTC7GG 

7CGGCAAAC7CCC7AAG7ACCCGC7ACAAT7C7GCAGAAAAAGAA7G7G7C77GCGAGC 

7G77G7AG77ACAG7AAA7ACAC7G7GAAGAGAC777A77GCC7A77A7AA77A7T777 

A7C7GAAGC7AGAGGAATAAAGCTGTGAGCAAACAGAGGAGGCCAGCCTCACCTCATTC 

CAACACC7GCCA7AGGGACCAACGGGAGCGAG77GG7CACCGC7C7777CA77GAAGAG 

77GAGGATGTGGCACAAAGT7GGTGCCAAGCTTCTTGAATAAAACGTGTTTGATGGATT 

AG7A77A7ACC7GAAA7A7777C77CC77C7CAGCAC777CCCA7G7A77GA7AC7GG7 

CCCAC77CACAGC7GGAGACACCGGAG7A7G7GCAG7G7GGGA777GAC7CC7CCAAGG 

T777G7GGAAAG77AA7G7CAAGGAAAGGA7GCACCACGGGCT777AA7777AA7CC7G 

GAG7C7CAC7G7C7GC7GGCAAAGA7AGAGAA7GCCC7CAGC7C77AGC7GG7C7AAGA 

ATGACGATGCCTTCAAAATGC7GCTTCCACTCAGGGCTTCTCCTC7GCTAGGCTACCCT 

CCTCTAGAAGGCTGAGTACCATGGGCTACAGTGTCTGGCCT7GGGAAGAAGTGATTCTG 

7CCC7CCAAAGAAA7AGGGCA7GGC77GCCCC7G7GGCCC7GGCA7CCAAA7GGC7GC7 

T77G7C7CGC77ACC7CG7GAAGAGGGGAAG7C7G77CG7GCC7CCCAAGCAGC7GAAG 

GG7GAC7AAACGGGCGCCAAGAC7CAGGGGA7CGGC7GGGAAC7GGGCCAGCAGAGCA7 

G77GGACACCCCCCACCA7GG7GGGC77G7GG7GGC7GC7CCA7GAGGG7GGGGG7GA7 

AC7AC7AGA7CAC77GTCC7C77GCCAGCTCA777G77AA7AAAA7AC7GAAAACACAA 

. L -AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAA (SEQ ID NO: 25) 
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rl^SDLLKNDYFSAEDAEIVCACPTQFDKVRKILDLVQSKGEZVSEFFLYLL 
QQLADAYVDLRPWLLEIGFSPSLLTQSKVVVNTDPVSRYTCQLRKHLGRDS 
KF VLC YAQ KEELLLEE I YMDT I MEL VG F SNES LG S LNS LACLLDHTTG I LN 
EQGETIFILGDAGVGKSMLLQRLQSLWATGRLDAGVKFFFHFRCRMFSCFK 
ESDRLCLQDLLFKHYCYPERDPEEVFAFLLRFPHVAIiFTFDGLDELKSDLD 
LSRVPDSSCPWEPAHPLVLIoANLLSGKLLKGASKLLTARTGIEVPRQFLRK 
KVLL.RGFSPSHLRAYARRMFPERALQDRLLSQLEANPNLCSLCSVPLFCWI 
I FRCFQHFRAAFEGS PQLPDCTMTLTDVFLLVTEVHLNRMQPSSLVQRNTR 
5PVETLHAGRDTLCSLGQVAHRGMEKSLFVFTQEEVQASGLQERDMQLGFL 
RALPELGPGGDQQS YEFFHLSLLTCKTGI PV (SEQ ID NO: 26) 
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CCCGCGTCCGCGTCCCCGGACCATGGCGCTCTCCGGGCTCTTCTCTAGCTCTCAGCGGCT 
GCGAAGTCTGTNAACCTGGTGGCCAAGTGATTGTAAGTCAGGAGACTTTCCTTCGGTTTC 
TGCCTTTGATGGCAAGAGGTGGAGATTGTGGCGGCGATTACAGAAAACATCTGGGAAGAC 
AAGTTGCTGTTTTTATGGGAATCGCAGGCTTGGAAGAGACAGAAGCAATTCCAGAAATAA 
ATTGGAAATTGAAGATTTAAACAATGTTGTTTTAAAATATTCTAACTTCAAAGAATGATG 
CCAGAAACTTAAAAAGGGGCTGCGCAGAGTAGCAGGGGCCCTGGAGGGCGCGGCCTGAAT 
CCTGATTGCCCTTCTGCTGAGAGGACACACGCAGCTGAAGATGAATTTGGGAAAAGTAGC 
CGCTTGCTACTTTAACTATGGAAGAGCAGGGCCACAGTGAGATGGAAATAATCCCATCAG 
AGTCTCACCCCCACATTGAATTACTGAAAAGCAATCGGGAACTTCTGGTCACTCACATCC 
GCAATACTCAGTGTCTGGTGGACAACTTGCTGAAGAATGACTACTTCTCGGCCGAAGATG 
CGGAGATTGTGTGTGCCTGCCCCACCCAGCCTGACAAGGTCCGCAAAATTCTGGACCTGG 
TACAGAGCAAGGGCGAGGAGGTGTCCGAGTTCTTCCTCTACTTGCTCCAGCAACTCGCAG 
ATGCCTACGTGGACCTCAGGCCTTGGCTGCTGGAGATCGGCTTCTCCCCTTCCCTGCTCA 
CTCAGAGCAAAGTCGTGGTCAACACTGACCCAGTGAGCAGGTATACCCAGCAGCTGCGAC 
ACCATCTGGGCCGTGACTCCAAGTTCGTGCTGTGCTATGCCCAGAAGGAGGAGCTGCTGC 
TGGAGGAGATCTACATGGACACCATCATGGAGCTGGTTGGCTTCAGCAATGAGAGCCTGG 
GCAGCCTGAACAGCCTGGCCTGCCTCCTGGACCACACCACCGGCATCCTCAATGAGCAGG 
CTGCTTCAAGGAAAGTGACAGGCTGTGTCTGCAGGACCTGCTCTTCAAGCACTACTGCTA 
CCCAGAGCGGGACCCCGAGGAGGTGTTTGCCTTCCTGCTGCGCTTCCCCCACGTGGCCCT 
CTTCACCTTCGATGGCCTGGACGAGCTGCACTCGGACTTGGACCTGAGCCGCGTGCCTGA 
CAGCTCCTGCCCCTGGGAGCCTGCCCACCCCCTGGTCTTGCTGGCCAACCTGCTCAGTGG 
GAAGCTGCTCAAGGGGGCTAGCAAGCTGCTCACAGCCCGCACAGGCATCGAGGTCCCGCG 
CCAGTTCCTGCGGAAGAAGGTGCTTCTCCGGGGCTTCTCCCCCAGCCACCTGCGGGCCTA 
TGCCAGGAGGATGTTCCCCGAGCGGGCCCTGCAGGACCGCCTGCTGAGCCAGCTGGAGGC 
CAACCCCAACCTCTGCAGCCTGTGCTCTGTGCCCCTCTTCTGCTGGATCATCTTCCGGTG 
CTTCCAGCACTTCCGTGCTGCCTTTGAAGGCTCACCACAGCTGCCCGACTGCACGATGAC 
CCTGACAGATGTCTTCCTCCTGGTCACTGAGGTCCATCTGAACAGGATGCAGCCCAGCAG 
CCTGGTGCAGCGGAACACACGCAGCCCAGTGGAGACCCTCCACGCCGGCCGGGACACTCT 
GTGCTCGCTGGGGCAGGTGGCCCACCGGGGCATGGAGAAGAGCCTCTTTGTCTTCACCCA 
GGAGGAGGTGCAGGCCTCCGGGCTGCAGGAGAGAGACATGCAGCTGGGCTTCCTGCGGGC 
TTTGCCGGAGCTGGGCCCCGGGGGTGACCAGCAGTCCTATGAGTTTTTCCACCTCACCCT 
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CCAGGCCTTCTTTACAGCCTTCTTCCTCGTGCTGGACGACAGGGTGGGCACTCAGGAGCT 
GCTCAGGTTCTTCCAGGAGTGGATGCCCCCTGCGGGGGCAGCGACCACGTCCTGCTATCC 
TCCCTTCCTCCCGTTCCAGTGCCTGCAGGGCAGTGGTCCGGCGCGGGAAGACCTCTTCAA 
GAACAAGGATCACTTCCAGTTCACCAACCTCTTCCTGTGCGGGCTGTTGKCCAAAGCCAA 
ACAGAAACTCCTGCGGCATCTGGTGCCCGCGGCAGCCCTGAGGAGAAAGCGCAAGGCCCT 
GTGGGCACACCTGTTTTCCAGCCTGCGGGGCTACCTGAAGAGCCTGCCCCGCGTTCAGGT 
CGAAAGCTTCAACCAGGTGCAGGCCATGCCCACGTTCATCTGGATGCTGCGCTGCATCTA 
CGAGACACAGAGCCAGAAGGTGGGGCAGCTGGCGGCCAGGGGCATCTGCGCCAACTACCT 
CAAGCTGACCTACTGCAACGCCTGCTCGGCCGACTGCAGCGCCCTCTCCTTCGTCCTGCA 
TCACTTCCCCAAGCGGCTGGCCCTAGACCTAGACAACAACAATCTCAACGACTACGGCGT 
GCGGGAGCTGCAGCCCTGCTTCAGCCGCCTCACTGTTCTCAGACTCAGCGTAAACCAGAT 
CACTGACGGTGGGGTAAAGGTGCTAAGCGAAGAGCTGACCAAATACAAAATTGTGACCTA 
T TTGGGTTTAT A C AAC AAC C AG AT C AC C G AT GTCGG AG CCAGGTACGTC AC CAAAATCCT 
GGATGAATGCAAAGGCCTCACGCATCTTAAACTGGGAAAAAACAAAATAACAAGTGAAGG 
AGGGAAGTATCTCGCCCTGGCTGTGAAGAACAGCAAATCAATCTCTGAGGTTGGGATGTG 
GGGCAATCAAGTTGGGGATGAAGGAGCAAAAGCCTTCGCAGAGGCTCTGCGGAACCACCC 
CAGCTTGACCACCCTGAGTCTTGCGTCCAACGGCATCTCCACAGAAGGAGGAAAGAGCCT 
TGCGAGGGCCCTGCAGCAGAACACGTCTCTAGAAATACTGTGGCTGACCCAAAATGAACT 
CAACGATGAAGTGGCAGAGAGTTTGGCAGAAATGTTGAAAGTCAACCAGACGTTAAAGCA 
TTTATGGCTTATCCAGAATCASATCACAGCTWARGGGACTGCCCAGCTGGCAGATGCGTT 
ACA.GAGCAACACTGGCATAACAGAGATTTGCCTAAATGGAAACCTGATAAAACCAGAGGA 
GGCCAAAGTCTATGAAGATGAGAAGCGGATTATCTGTTTCTGAGAGGATGCTTTCCTGTT 
CATGGGGTTTTTGCCCTGGAGCCTCAGCAGCAAATGCCACTYTGGGCAGTCTTTTGTGTC 
AGTGTCTTAAAGGGGCCTGCGCAGGCGGGACTATCAGGAGTCCACTGCCTCCATGATGCA 
AGCCAGCTTCCTGTGCAGAAGGTCTGGTCGGCAAACTCCCTAAGTACCCGCTACAATTCT 
GCAGAAAAAGAATGTGTCTTGCGAGCTGTTGTAGTTACAGTAAATACACTGTGAAGAGAC 
TTTATTGCCTATTATAATTATTTTTATCTGAAGCTAGAGGAATAAAGCTGTGAGCAAACA 
GAGGAGGCCAGCCTCACCTCATTCCAACACCTGCCATAGGGACCAACGGGAGCGAGTTGG 
TCACCGCTCTTTTCATTGAAGAGTTGAGGATGTGGCACAAA.GTTGGTGCCAAGCTTCTTG 
AATAAAACGTGTTTGATGGATTAGTATTATACCTGAAATATTTTCTTCCTTCTCAGCACT 
TTCCCATGTATTGATACTGGTCCCACTTCACAGCTGGAGACACCGGAGTATGTGCAGTGT 
GGGATTTGACTCCTCCAAGGTTTTGTGGAAAGTTAATGTCAAGGAAAGGATGCACCACGG 
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GCTTTTAATTTTAATCCTGGAGTCTCTACTGTCTGCTGGCAAAGATAGAGAATGCCCTCAG 
CTCTTAGCTGGTCTAAGAATGACGATGCCTTCAAAATGCTGCTTCCACTCAGGGCTTCTC 
CTCTGCTAGGCTACCCTCCTCTAGAAGGCTGAGTACCATGGGCTACAGTGTCTGGCCTTG 
GGAAGAAGTGATTCTGTCCCTCCAAAGAAATAGGGCATGGCTTGCCCCTGTGGCCCTGGC 
ATCCAAATGGCTGCTTTTGTCTCCCTTACCTCGTGAAGAGGGGAAGTCTCTTCCTGCCTC 
CCAAGCAGCTGAAGGGTGACTAAACGGGCGCCAAGACTCAGGGGATCGGCTGGGAACTGG 
GCCAGCAGAGCATGTTGGACACCCCCCACCATGGTGGGCTTGTGGTGGCTGCTCCATGAG 
GGTGGGGGTGATACTACTAGATCACTTGTCCTCTTGCCAGCTCATTTGTTAATAAAATAC 
TGAAAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGCGG ( SEQ ID NO : 38 ) 
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MEEQGHSEMEIIFSESKPHIQLLKSNRELLVTHIRNTQCLVDNLLKNDYFSAEDAEIVCA 
CPTQPDKVRKILDLVQSKGEEVSEFFLYLLQQLADAYVDLRPWLLEIGFSPSLLTQSKVV 
^/NTDPVSRYTQQLRHHLGRDSKFVLCYAQKEELLLEEIYMDTIMELVGFSNESLGSLNSL 
ACLLDHTTGII^EQAASRKVTGCVCRTCSSSTTATQSGTPRRCLPSCCASPTWPSSPSMA 
WTSCTRTWT (SEQ ID NO : 39 ) 
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CACGCGTCCGCGCTACTGCGGGAGCAGCGTCCTCCCGGGCCACGGCGCTTCCCGGCCCCG 
GCGTCCCCGGACCATGGCGCTCTCCGGGCTCTTCTCTAGCTCTCAGCGGCTGCGAAGTCT 
GTAAACCTGGTGGCCAAGTGATTGTAAGTCAGGAGACTTTCCTTCGGTTTCTGCCTTTGA 
TGGCAAGAGGTGGAGATTGTGGCGGCGATTACAGAAAACATCTGGGAAGACAAGTTGCTG 
TTTTTATGGGAAT.CGCAGGCTTGGAAGAGACAGAAGCAATTCCAGAAATAAATTGGAAAT 
TGAAGATTTAAACAATGTTGTTTTAAAATATTCTAACTTCAAAGAATGATGCCAGAAACT 
TAAAAAGGGGCTGCGCAGAGTAGCAGGGGCCCTGGAGGGCGCGGCCTGAATCCTGATTGC 
CCTTCTGCTGAGAGGACACACGCAGCTGAAGATGAATTTGGGAAAAGTAGCCGCTTGCTA 
CTTTAACTATGGAAGAGCAGGGCCACAGTGAGATGGAAATAATCCCATCAGAGTCTCACC 
CCCACATTCAATTACTGAAAAGCAATCGGGAACTTCTGGTCACTCACATCCGCAATACTC 
AGTGTCTGGTGGACAACTTGCTGAAGAATGACTACTTCTCGGCCGAAGATGCGGAGATTG 
TGTGTGCCTGCCC'CACCCAGCCTGACAAGGTCCGCAAAATTCTGGACCTGGTACAGAGCA 
AGGGCGAGGAGGTGTCCGAGTTCTTCCTCTACTTGCTCCAGCAACTCGCAGATGCCTACG 
TGGACCTCAGGCCTTGGCTGCTGGAGATCGGCTTCTCCCCTTCCCTGCTCACTCAGAGCA 
AAGTCGTGGTCAACACTGACCCAGGTAGGAGTCAGCCCCAGCAAGACCGCAGGCACCAGT 
GCAAGCAGGGCCCTGGGGGGTTTGGTAATGGCTGGGCCAGCCCTGAGTGCCACCTCAGGA 
AGCAGGCCCAGGTGCTATTTTGATTTTAGAAAGGAACAGCTGAATCCTGTCTCCCAAGTG 
CAGCCCAGGTGGCTGCGATTGAACTGCCCACA'CCTCGATGGTCTGGTTTATAGAGGGGCC 
TTTGGAAGTATGGGAATGGCCTGTGTTCTGACCCCTTGCTTTCTTCCTATTCTGACATAT 
GTAGACATTTTAATGGTTGCACAAATTCAAGGTTGTATTTTTTTTTCTTTAAAAAAATCT 
TTAGCTGGACATGGTAGCACACACCTGTAGTTCCAGCTACTCAGGAGGCTGAGGCAAGAG 
GACTGCTTGAGCCCCAGAGTCTAAGGCTGCAGCGAGCTATGATTGTGCCCCTACACTCCA 
CAGCCTGGGTTTTAGAGTGAGACCCTGTCTCTAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAANGGGCGG (SEQ ID NO:4q) 



FIG. 12 



15 / 35 



WO 99/40102 PCT/US99/02544 



MEEQGHSEMEIIPSESKFHIQLLKSNRELLVTHIRNTQCLVDNLLKNDYFSAEDAEIVCA 
CPTQPDKVRKILDLVQSKGEEVSEFFLYLLQQljADAyVDLRPWLLEIGFSPSLLTQSKVV 
^/NTDPGRSQPQQDRRHQCKQGPGGFGNGWASPECHLRKQAQVLF 
(SEQ ID NO: 41 ) 
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CCACGCGTCCGCGGACCGCGAGCGGTAGCGCCCTCCCTCCCAGCTGTTGTCCCGCCCGAT 
CCGCGACCCTAGTCCCCGGATCCCCTTGCTGAGAGTCACCGTACTCCAGGGCCAACTGAG 
CCAAAGTCCTGCCAACTTGGGTCAGGAATGAAAGGCAGGATCCTGGGTGGTGGCCCTGAA 
TGCTGATTTGTCTGCCCTGCCAGGGAGACACATGTGGTCAAAGATGAATTTGAGAAAAGT 
AGCTGCTGGCTACTTGAACAATGGAGGAACACGGCCATCATGAGATGGAAGGCACCCCAT 
TGGGTTGTCACTCCCACATTAAACTGCTGAAGATCAACAGGGAACATCTGGTCACCAACA 
TTCGGAACACTCAGTGTCTGGTGGACAACTTGCTGGAGAATGGCTACTTGTCAGCCGAAG 
ATGCAGAGATTGTGTGTGCCTGTCCCACCAAGCCTGACAAGGTCCGAAAGATCCTTGACC 
TGGTGCAGAGCAAAGGCGAGGAGGTGTCTGAGTTCTTCCTCTACGTGCTGCAGCAGCTGG 
AGGATGCTTACGTGGACCTCAGGCTGTGGCTCTCAGAAATTGGCTTCTCCCCTTCCCAGC 
TCATTCGGACCAAAACTATCGTCAATACTGACCCAGTAAGCAGGTATACCCAACAGCTGC 
GACACCAACTGGGCCGCGACTCGAAGTTCATGCTGTGCTACGCCCAGAAGGAGGACCTGC 
TGCTGGAGGAGACCTATATGGACACACTCATGGGGCTGGTAGGCTTCAACAATGAAAACC 
TGGGCAGCCTAGGAGGCCTGGATTGCCTGCTGGACCACAGTACGGGCGTCCTCAACGAGC 
ATGGCGAGACTGTCTTCGTGTTCGGGGACGCGGGAGTGGGCAAGTCCATGCTGCTGCAGA 
GGTTGCAGAGCCTCTGGGCGTGAGGCAGGTTGACCTCCACAGCCAAATTCTTCTTCCACT 
TCCGCTGCCGCATGTTCAGCTGCTTCAAGGAGAGCGACATGCTGAGTCTGCAGGACCTGC 
TCTTCAAGCATTTCTGCTACCCGGAGCAGGACCCCGAGGAGGTGTTCTCCTTCTTGCTGC 
GCTTTCCCCACACAGCGCTCTTCACTTTTGACGGCCTGGATGAGCTGCACTCAGACTTCG 
ACCTGAGCCGCGTGCCGGATAGCTGCTGCCCCTGGGAGCCGGCTCACCCTCTGGTCCTGC 
TGGCTAACCTCCTAAGTGGGAGGCTGCTCAAGGGTGCCGGCAAATTGCTCACTGGTCGCA 
CAGGCGTGGAGGTCCCCCGCCAGCTCCTGGGCAAAAAGGTGCTGCTCCGGGGCTTCTCCC 
CAAGTCACCTGCGCGCCTATGCCCGCCGGATGTTCCCCGAGCGCACAGCGCAGGAGCATC 
TGCTGCAGCAGCTGGATGCCAACGCCAACCTCTGCAGCCTGTGCGGGGTGCCGCTCTTCT 
GTTGGATCATCTTCCGTTGTTTCCAGCACTTCCAGACGGTCTTCGAGGGCTCCTCTTCAC 
AGTTGCCGGACTGTGCTGTGACCCTGACGGATGTCTTTCTGCTGGTCACTGAGGTGCATC 
TGAACAGGCCGCAGCCCAGCAGCCTGGTGCAGCGCAACACGCGCAGCCCGGCGGAAACCC 
TACGTGCAGGCTGGCGCACGCTGCATGCGCTGGGAGAGGTGGCTCACCGAGGGACCGACA 
AGAGCCTCTTTGTGTTTGGCCAGGAGGAGGTGCAGGCGTCGAAGCTGCAGGAAGGAGATC 
TGCAGCTGGGCTTCCTGGGGGCTTTGCCCGATGTGGGCCCTGAGCAGGGCCAGTCTTACG 
AATTTTTCCACCTTACGCTCCAGGGGTTCTTCACCGCCTTCTTCCTGGTAGCAGATGACA 
AAGTGAGCACCCGGGAGTTGCTGAGGTTCTTTCGAGAATGGACGTCTCCTGGAGAGGCAA 
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CAAGCTCGTCCTGCCATTCTTCCTTCTTCTCCTTCCAGTGCCTGGGCGGCAGAAGCCGGT 
TGGGCCCTGATCCTTTCAGGAACAAAGATCACTTCCAGTTCACCAACCTCTTCGTGTGCG 
GGCTACTGGCCAAAGCCCGACAGAAACTCCTTCGGGAGCTGGTGCCCAAGGCTATCCTGA 
GGAGGAAGCGCAAGGCCCTGTGGGCTCACCTGTTTGCTAGCCTGCGCTCCTACTTGAAGA 
GCCTACCTCGGGTCCAGTCTGGAGGCTTTAACCAGGTGCATGCCATGCCCACATTCCTGT 
GGATGCTGCGCTGCATCTATGAGACGCAGAGCCAGAAGGTGGGGCGCCTCGCCGCGAGGG 
GCATCAGTGCGGACTACCTCAAGCTGGCCTTTTGCAACGCTTGCTGTGCGGACTGCAGCG 
CCCTGTCCTTCGTCCTGCATCACTTCCACAGGCAGCTGGCCCTAGACCTGGACAACAACA 
ACCTCAATGACTATGGCGTGCAGGAGCTGCAGCCTTGCTTTAGCCGTCTCACGGTTATCA 
GACTCAGCGTCAACCAGATCACCGACACGGGGGTGAAGGTGCTATGTGAGGAACTGACCA 
AGTATAAGATCGTGACGTTCCTGGGTTTATACAACAACCAGATAACTGATATCGGAGCCA 
GGTATGTGGCCCAAATCCTGGATGAATGCAGAGGCCTCAAGCACCTTAAACTAGGGAAAA 
ACAGAATAACAAGTGAGGGCGGGAAGTGTGTGGCTTTGGCTGTGAAGAACAGCACCTCCA 
TCGTTGATGTTGGGATGTGGGGTAATCAGATTGGAGACGAAGGGGCAAAGGCCTTCGCAG 
AGGCATTGAAGGACCACCCCAGCCTGACCACTCTCAGTCTTGCATTCAATGGCATCTCTC 
CGGAGGGAGGGAAGAGCCTTGCGCAGGCCCTGAAGCAGAACACCACACTGACAGTAATCT 
GGCTGACCAAAAATGAACTTAATGATGAGTCTGCAGAGTGCTTCGCTGAGATGCTGAGAG 
TGAAC CAGACG CTACGGCATTTATGGCTGATCCAGAATCGCATCACAGCCAAGGGGACAG 
CGCAGCTGGCGAGGGCACTGCAGAAGAACACAGCCATAACAGAGATTTGTCTCAATGGAA 
ACTTGATTAAGCCCGAGGAGGCCAAAGTCTTCGAGAATGAGAAGAGAATCATCTGCTTCT 
GACGGACGCTCCTGGGCAGGATCTTTGTCCTAGGTTGCTCCTCAGTCACAGACAGCACTG 
TGCAGTCAGCAGGGTAGCAGGATGCTGTGCAGCGCCTGCAGCAAGGTGCCTGTCAGGAGC 
CCACACCTCCACAGTGCACACCGATGTCCCCTGCTCATGCTTGGACTGGTAGCACCCGCG 
CCGCGGCTGAGACCCTGCAGACGCAGGGAGTCTTAGGAACCATCGTCACCACTCAAAGCC 
AGCAGGGCATCTTCTGTACAAAGATCTCCCTGCATATCCACTAGACGGAAGCTGAAGGAA 
CGCAACAGCAGAGGAGGCCAACAGACGCCTGGCTGAAGGCTCCGTGGGACCAACGGTGTC 
ACCTTCAGAAAAGAGCTGGGAACTTGAGCAGAGCCGATGGTAACTTCTTGGGGAAAGAAG 
GCACCCAGTGACTGCATGGTTATTCTGAGTCCTCCTTCCTCTGCTTAGTCCCTCTCACTG 
TACAGGTCTGTTTCTTCCTCGCAGCTGTGGCTGCTGAAGTAGGTCCACTGTGGGGAGAGC 
TCATCACAGACTTTGGTTCGGTTCTGGATTCTCAGTGGTGGCAACCGAGAGTCAGACGAT 
ACCCTCTAGGTCAGTCTCAGAGGATCTCTATGCTGTGAGAGGGTTGAGGGCCCACCCAGA 
ATTTTTTTTTTTTACCAGTTTTTACTGTGCCTGCCCCAGGAGGGAGAATTACTTCCCAGC 
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CTCCACAGCAGCAGGCATGGCTTGCCTCAATGGTCCTGAGATCCCAACAAAACTCTCTCC 

CTTGCCTGTGAGCAGAAAGTATCTTCATGTCCTCAGAAGTTGGAGGGTGACTGGACACAG 

TTAAGACTCAGAGAGCCAGCTGATAGCTCAAAGCAAAGCATGGCACATACCCACCACCAT 

ACCATGGTGCGCATGGGATGGGACAGTTGGAATGTTGCAGATAACGTGTTCTTTTGCCAG 

TTCATTTGTTAATAAAATATTTAAAACGTTAAAAAAAAAAAAAAAAAAAAAAAA 

G (SEQ ID NO: 42 ) 
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MEEHGKKEMEGTPLGCHSKIKLLKINREHLVTNIRNTQCLVDNLLENGYFSAEDAEIVCA 
CPTKPDKVRKILDLVQSKGEEVSEFFLYVLQQLEDA^TDLRLWLSEIGFSPSQLIRTKTI 
WTDPVSRYTQQLRHQLGRDSKFMLCYAQKEDLLLEETYMDTLMGLVGFNNENLGSLGGL 
DCLLDHSTGVLiNEHGETVFVFGDAGVGKSMLLQRLQSLWASGRLTSTAKFFFHFRCRMFS 
CFKESDMLSLQDLLFKHFCYPEQDPEEVFSFLLRFPHTALFTFDGLDELHSDFDLSRVPD 
SCCPWEPAHPLVLIiANLLSGRLLKGAGKLLTARTGVEVPRQLLRKXVLLRGFSPSHLRA^ 
ARRMFPERTAQEHLLQQLDANPNLCSLCGVPLFCWIIFRCFQHFQTVFEGSSSQLPDCAV 
TLTDVFLLVTEVHLNRPQPSSLVQRNTRSPAETLRAGWRTLKALGEVAHRGTDKSLFVFG 
QEEVQASKLQEGDLQLGFLRALPDVGFEQGQSYEFFKLTLQAFFTAFFLVADDKVSTREL 
LRFFREWTSPGEATSSSCHSSFFSFQCLGGRSRLGPDPFRNKDHFQFTNLFVCGLLAKAR 
QKLLRQLVPKAILRRKRKALWAHLFASLRSYLKSLPRVQSGGFNQVHAMPTFLWMLRCIY 
ETQSQKVGRLAARG I S ADYLKLAFCNACS ADCSALS FVLHHFHRQLALDLDNNNLNDYGV 

QELQPCFSRLTVIRLSVNQITDTGVKVLCEELTKYKIVTFLGLYNNQITDIGARYVAQIL 
DECRGLKHLKLGKNR I TS EGGKCVALAVKNS TS I VDVGMWGNQ I GDEGAKAFAEALKDHP 

SLTTLSLJVFNGISPEGGKSLAQALKQNTTLTVIWLTXNELNDESAECFAEMLRVNQTLRH 
LWLI QNRI TAKGTAQLARALQKNTAI TE I CLNGNL I KP E EAKVFENEKR 1 1 CF 
(SEQ ID NO: 43) 
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gatcatcgttcactgcagccttgaactcttgtgctcatgtgatcctcctgccttagcctccccaa 
tagctgggaccacaggtgcgccaccatgcctggctaattttttttatttttgtagagatgggtgt 
ctcactatgttgcacaggttggtctcaaactactggccttacttcaagctatctacccatctcag 
cctcccaaagcgctgggattacagtcatgagccaacttgcctggccagataaaggtcttaagcat 
ggttccttcctgctctaggtagagaaaccccacaaccagtgggaggtggggtgagctctttctgt 
agcttrtgccttgctgatgatgtcattgatctcttcaggggctgcgcagagtagcaggggccctg 
g^gggcgcggcctgaatcctgattgcccttctgctgagaggacacacgcagctgaagatgaattt 
gggaaaagtagccgcttgctactttaactatggaagagcagggccacagtgagatggaaataatc 
ccatcagagtctcacccccacattcaattactgaaaagcaatcgggaacttctggtcactcacat 
ccgcaatactcagtgtctggtggacaacttgctgaagaatgactacttctcggccgaagatgcgg 
agattgtgtgtgcctgccccacccagcctgacaaggtgccccggggacagggacgggcatggcat 
tgtgtggaccccgggagctagaagaggcccctccctgctgatctgagtgaagagcgtgggagttt 
agtccagcgggcagggctgcattttggggtactaatagcacacaaatgcctgggttagcaggttg 
cacagtcaggtattttacttctgtgtttgtgtctggagcaaaccctgacatctcagttctcattg 
ctgtgtgtattggttcccagacacttcatttttagatcccctttaaattaggagggaaaaagaac 
ataagcataagagcatccccagcagcgatgttcattcagtgcctctgaaggctggagggctgctt 
gttgctgggtgagactcggaggggaaccgactcagggtcaggaatgatgacatcccacggtgggt 
ccacagtgaagaatcttccccgctccactgtgggacgccttaacagcccttacttccacttacgc 
tttgcgttatctcctgaaaaataaaatggagaccacaaattccttcttggttagaggaatgacac 
aactcatttatgacatgaccccgctgggactcagaagagaccaggacggtttctgggggaagcag 
tagcacactcgtgtgctttgttctcttctcttgatttgttttcccacatttttaacaagaaaaaa 
agccgtttttaatatatggcctatcgccctcctactgtgtggcccaggtgcctacctcattatgc 
ccaaggggtggttctcacctctccactctcattcctgcacagcagttgtgtcaggttaagaggga 
caaggagaaggctgggcaccgtggctcacgcctgtaatcccagcactttgggaggccgaggcagg 
cagatcacctaaggtcaggagtttgagaccagcctggccaacatggggaaaacccgtctctaata 
aaaacacaaaaattagtcgggcatggtggtgggtgcctgtaatcccagccacttgggaggctgag 
gaaagagaattccttgaacctgggaggtggaggttgcagtgagccaagattgtgccattgcactc 
cagccctccagcctgggtgacagagcaagactctgtctcaaaaaagaaaaaaaaaaaaaagaggt 
agagaagtccatggctatttgtctgtcctttttatttttaggctcatggaagcctcctggtttct 
tagagctgagtggttttatttcttgctcaggaggtcatttcacagattttcgggctccaatatgt 
tgactgtcacagcagctggggggatggcatagctaccggctgtactaagaactcagagccctgcc 
ctgagcctgcctgagggtccttatggtaggaggatgcccctcatgccagcccgtgccctcatgct 
tgtgtcacctccaggtccgcaaaattctggacctggtacagagcaagggcgaggaggtgtccgag 
ttcttcctctacttgctccagcaactcgcagatgcctacgtggacctcaggccttggctgctgga 
gatcggcttctccccttccctgctcactcagagcaaagtcgtggtcaacactgacccaggtagga 
gtcagccccagcaagaccgcaggcaccagtgcaagcagggccctggggggtttggtaatggctgg 
gccagccctgagtgccacctcaggaagcaggcccaggtgctattttgattttagaaaggaacagc 
tgaatcctgtctcccaagtgcagcccaggtggctgcgattgaactgcccacacctcgatggtctg 
gtttatagaggggcctttggaagtatgggaatggcctgtgttctgaccccttgctttcttcctat 
tctgacatatgtagacattttaatggttgcacaaattcaaggttgtatttttttttcttttaaaa 
aaatctttagctggacatggtagcacacacctgtagttccagctactcaggaggctgaggcaaga 
ggactgcttgagccccagagtctaaggctgcagcgagctatgattgtgcccctacactccagcct 
gggtg a cagagtgagaccctgtctctaaaaaaggaaagaaaaaaattaaaaagccttgccaggtt 
tgattctaggcaaagtattctgtcaccgttgagtgccagtccttatttccaaactaatggaagac 
cccatcagttaactgattagttcaataagtattttttgctgtatccaccacatgccaagacccta 
cactgtgctggatgtcagggagacagtggtgagcagacacagacagggttcctgccctcagggag 
cttcaagtcagctggaagagaccaccagtcagcaatctcaaaaatgtgtcaggacagcggcagtc 
caaggcatgtgagaacatatcattagggccaggatctgctctggggcaggagtcttctttccctg 
cttttgaactctccactttgagacagctgttggtaacataccagcaccaaggacctaagtcctgc 
cttttaaagaatccaatatgttgttggaaacagaagcacaagacaggtgtgtgcttaggggaaac 
aaggccagccggcagagtgtcagtgctaggctccagcttccacagcccctgcaggtgcctgccag 
ccactgctagcttctgactctgtctgctccttcctgtctccccttgtttccttcccccatgaaaa 
aaaaagaaagtattcccatgaggaatcattctttcgaaagacttctctgttggttccgttagcca 
gctactttactagcttttacagtgtaattcactctacaagcagtctcacacaaaagactacatat 
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tgtatgattctgtttatatgaaatgtccagaaaaggtaaatctatagacaaagcaaatcagtagt 
tgcctacggcccagggattggctacaaataggctccagaaaactctgggaagatggtagagatgt 
tctagacctggactgtggtgaggtttgcacaactttgtaaacttactaaaaattactgacaaata 
tataacactccctaacactttgggaggccgaggtgggcagatcgcttgaacccaggaatttgaga 
ccagcctgggcaacatggcgagaccccgtctctacaaaaaaacacaaaaattagttgggcttggt 
ggcatatgcctgtgtcccagctacttgggaggctgaggtgggaggattgcttgagcctgggagtt 
tgagactgcatgattgggtcactgcaccctagcctgagtgacagagcaggaccctatctctaaca 
acaaaaaagcagtgttggtggaggagggccagcgtggccatctggcctggccctcgagtgcgagg 
ggcttcagtgtttagctgcagttcagtgatgacactgtgcggaggaataagggtggcctgtctca 
gacactgatcccagctgaagtttgtcaccttctttctggcaaatctgaggtcaagcagagagatc 
aaagcctggggccctcagggtcaggaatgctggctctgtgacgctccccaggtcctgcatctgag 
gagtggctgcgctggcctcagggcccaggttgtgaattttgtttatgcactcgcctctcctcttt 
gagacctccctgtttgatgctgtttctgcctctctcctcaccctgctgctgtgccctgccacccc 
ctccctccagtgagcaggtatacccagcagctgcgacaccatctgggccgtgactccaagttcgt 
gctgtgctatgcccagaaggaggagctgctgctggaggagatctacatggacaccatcatggagc 
tggttggcttcagcaatgagagcctgggcagcctgaacagcctggcctgcctcctggaccacacc 
accggcatcctcaatgagcagggtgagaccatcttcatcctgggtgatgctggggtgggcaagtc 
catgctgctacagcggctgcagagcctctgggccacgggccggctagacgcaggggtcaaattct 
tcttccactttcgctgccgcatgttcagctgcttcaaggaaagtgacaggctgtgtctgcaggac 
ctgctcttcaagcactactgctacccagagcgggaccccgaggaggtgtttgccttcctgctgcg 
cttcccccacgtggccctcttcaccttcgatggcctggacgagctgcactcggacttggacctga 
gccgcgtgcctgacagctcctgcccctgggagcctgcccaccccctggtcttgctggccaacctg 
ctcagtgggaagctgctcaagggggctagcaagctgctcacagcccgcacaggcatcgaggtccc 
gcgccagttcctgcggaagaaggtgcttctccggggcttctcccccagccacctgcgcgcctatg 
ccaggaggatgttccccgagcgggccctgcaggaccgcctgctgagccagctggaggccaacccc 
aacctctgcagcctgtgctctgtgcccctcttctgctggatcatcttccggtgcttccagcactt 
ccgtgctgcctttgaaggctcaccacagctgcccgactgcacgatgaccctgacagatgtcttcc 
tcctggtcactgaggtccatctgaacaggatgcagcccagcagcctggtgcagcggaacacacgc 
agcccagtggagaccctccacgccggccgggacactctgtgctcgctggggcaggtggcccaccg 
gggcatggagaagagcctctttgtcttcacccaggaggaggtgcaggcctccgggctgcaggaga 
gagacatgcagctgggcttcctgcgggctttgccggagctgggccccgggggtgaccagcagt cc 
tatgagtttttccacctcaccctccaggccttctttacagccttcttcctcgtgctggacgacag 
ggtgggcactcaggagctgctcaggttcttccaggagtggatgccccctgcgggggcagcgacca 
cgtcctgctatcctcccttcctcccgttccagtgcctgcagggcagtggtccggcgcgggaagac 
ctcttcaagaacaaggatcacttccagttcaccaacctcttcctgtgcgggctgttgtccaaagc 
caaacagaaactccttcggcatctggtgcccgcggcagccctgaggagaaagcgcaaggccctgt 
gggcacacctgttttccagcctgcggggctacctgaagagcctgccccgcgttcaggtcgaaagc 
ttcaaccaggtgcaggccatgcccacgttcatctggatgctgcgctgcatctacgagacacagag 
ccagaaggtggggcagctggcggccaggggcatctgcgccaactacctcaagctgacctactgca 
acgcctgctcggccgactgcagcgccctctccttcgtcctgcatcacttccccaagcggctggcc 
ctagacctagacaacaacaatctcaacgactacggcgtgcgggagctgcagccctgcttcagccg 
cctcactgttctcaggtgaggctgccaggcaaggggagcaacaggtgggccgggcgggccaggct 
cggagggcatcgggaatggcatcatggaccaggatcccccaggactcatgaccatggcccttgga 
atgtccagaccttttctttcttagcagggcagaggtcaaggtgcaaagcttcgaggcaggtggac 
ctggatcagccacagctgggtgcccttgaacaaagtgcttaactctcagagcctccacgccctca 
tctggaaaaagaagatgctcataatcctatcaattatggccacagggaccaatgttagttgagaa 
tgggtgaagtgcattacaaatattacctaatggaatgctctttacaaccctgtaacttaggtact 
gttattgtctctattttggcagataaggaagtagaggcacagagaagttaatagcttgctttagg 
tcacacagctcagacatagcagtgccagaatgcataaagaaccttccttttaagattaatgtaag 
gctccgagatagccctcaaaaagtttctggaatatgggagcttttattactgcagagaaagcaga 
ccttgtgccagttggcactggtgactttctgtgatcaacgctagcagcccttcacactgctagag 
acctcagttaaaatgctgactcgtggttgttttcctgttccatagtttacgggaaacagagccca 
gtctgttttcttctattagcatttcctatgtaaaataaaccttgtaaatctctacagggggttaa 
atttgccattacttgactcatgcatttctaaaaagcagtagggatttggaactgactcccagtgc 
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ctgtcacaccagtgtcagagtgtaaataattgcatggggacatggggtgcagggggtcgaaggct 

gccctagcctgggaattggaaaacctggagtctgttctctgtactctcagccagtgactctccct 

ctgtagccccaggcagtctcacactcagtgccaccctctgtccatcttttttttttctcccccaa 

atggagtcccgctctgttgcccaggctggagtgcagtggcgtgatctcagctcactgcaacctcc 

gcctcctgggttcaagcgattctcttgccccagcctcctgagtagctgggattacaggcacacgc 

caccatgtccggctaagttntttgtatttttagtaggacggggtttccccatgttggccaggctg 

gtcttgaaatcctgacctcaggtgatccgcccgcctcggccttccaaaatgttggggttacaggc 

atgagccgccgcacccgacccctctgtccatcttttcaatgggaaactccacaccagtgtggtgg 

ccctgcccttcctgctgtccccaggtgaagctttccttcacaccagtgcaagaaaaaacagcttg 

taggaaagcagaggatatgggtaaccacgggaagcacactcagttctctggctgcatcagttagg 

attagttttagctgagagcgaaaaccccaaatgttggtgagttacaagcttatttctctcatgta 

aaagtctagaggtaggtagttcaggactggtatggagtctccatgaccctccggagcccaggctc 

tcttctgccttcctgttctgccatcctcactacccggctttcccatcttggcccaagagggctgc 

tcaaactccagccatctagtcgacactctagctatcagtaagaaggaagggcaaagattgagagc 

atgcctcaatcttttaagaacacttcttggctattactaattatattgctgcttagatttcagaa 

cttaatggtatgggcagaatttaatgagatgggcccagctaaaagatgggggaatctattgctaa 

gaaagtatagatattgggaatgtctagcagcctgtgctgtcttgggctggccatgccatgtacat 

acacactatttcccagcaccaagctggggactctgagggaaagggtccagagtgtctgacttgat 

cattttgatgtggcctaaaaatcaagcttttaattgttcagccttttacttgttatcaaggtcag 

cttgtgggtctaattgggcccaaggcttgtgtttctaagtaaagttttattggaacgcagccata 

cccatttatttacttactggctgcttcacactacacagttgagtagctgtgacagagaccacatg 

gcccacagagcctaaaatatttgctgtctgacactttacagaatgacatgagcagtctcctttga 

cagtgggactcacagccttttccagtgacaaatcagggttagcccatgtgtttctggatgggggg 

aagctgttggcattttgggtataacagttcttgtgagacctgtccagcattttgcaggacaccta 

acatcattggccctgcctgcaagatgacagggcactccctcctccagtcacaaccactaaaagca 

gcccctgacatttccaaacccatgccctccaccatacgagaaccaggtacagggtctggctgaca 

cataggtcacacgcaaagggtggatgtcagaggtggctggcctcacacgtcctccctgtgtcctt 

cacggtcgtgtgaggagccaggggctgtgctgcagcctcgctcatgggctggtgcaggatgggtc 

tggcggccccacgttggccaggctttgtaaggggctatttggctgattgctgtggccattctcca 

ggggcgtctatacctgagaaaactccagggcctgaaggcttctggatctttgtaagattaatggt 

ccttcataatgagtgcctgccctgactcgtaatttttttgctgttttatttcagactcagcgtaa 

accagatcactgacggtggggtaaaggtgctaagcgaagagctgaccaaatacaaaattgtgacc 

tatttggggtatgtctttctccagaacactgggccaactacctagtaataatacagagctgcagg 

gaattcacattcccataggtccctggatgatcggcacggatggcccagggctgggaagagcgctg 

gcccaggagttgagagtcctgggttctctttgtggctcggccagtcatgaagtcttgctgagcct 

cagcctcctcacctgtaaaactgggatcccagtataggcaagtaggcttacaactggttattggg 

ggatgcaacgagaatataaggggatatatttaataaatgctagaatcctgtttacatattagtct 

ggactattttgggtccataatccctcatccagagcctttggggcaagacccgaatggggattctg 

agtgcatgctatggcatgacgtggccgcaggggtctaaggcagtgccccattttcaaacactttc 

atatttctcccgcagaatgtatgaaacagtcaaaccaagtgtggtaagaaagactataagtagct 

ccacatcagttgccaaaagaattgtgagaaactttgggcattcagagcctttgaggttttggagt 

ctgagagaagggattgcgggccagccccacacaactggtggctctgcaagctggagcagttgttc 

agtttcttggggcctcagtggccttcgatgttaatgaggacatggacgcaaacgaccccgggcca 

cactcggctccagggctctgtgtggctgtggaaccctggaagcctgagcttagctgcctttcaac 

ttccatctgctgtactattgaattggcattgagcggtgagatggctgaaaggtagacatcgagaa 

gttttaatattcagaatcttttcttctcaagacgctgaatgtaatcttagttgtaaatacccatc 

acctgccagtcaccgagcactcatgcaccagggctttgcgttatgtcctaagatcctcataacca 

ccctgcaaggggactatcatcattacctctgtattacagatggagaaactgaggcacagagaggt 

aacgtgacttgtctcaggccataaagctggggaaagtagtggagctggttttgaacctgagctgt 

gagacctcagagccctaaactctggtgcctgtgtgttcccctttcaacccagactttggaaatca 

gtagacaccatatgcttcaaaaaacaggggctattaaaatgacatcaggagccagaaagtctcat 

ggctgtgctttctcttgaagtttatacaacaaccagatcaccgatgtcggagccaggtacgtcac 

caaaatcctggatgaatgcaaaggcctcacgcatcttaagtaagtggggtaggcaccaggttcct 

tagtatattctcttgat caccccc ttctgttgttcaaagattaaatgtcacagtaaagagctttc 

FIG. 18 (3 of 10) 

28 / 35 



BNSDOCID: <WO_. 9940102A1 I > 



WO 99/40102 



PCT/US99/02544 



atcctaaagccttccacttgtcccagggccatgttggtcaagtaaagatacctctgtgtgatctg 
tgaggcttggattctggaagggcctcccgttattggtagggggaaaggttggcattttgatttca 
ttaactactaggccgaagaaaggactaactctcaccctttctggtggtctttttgccccaaggga 
gtttcctgtcgggttgcaaggaagagcttgggcccttgccctgctgtaggtgtgccctgcgcagg 
gggtgacagtgcgccaggcttggagcctctggtcctgccctgacagtggccacataccttgaccc 
ttggcagtcaaagtgggacctcccaggtctcccgagggaagtcagtgatgctgctgaggtcaatt 
agaggaccccagggagggctcaggtccctgagcttctgcagagactgtggaccatctcctggaga 
ggaaccctgactgactgtcctcagggcttcagttccctccctgacaggaggcccaggccatggct 
cttgtggatcccagaagaaagtgtacggttcccaagatggggctggaaggggctctgtgctgggg 
aggagggtgacccacattggagcccctgcatagctggaggctgactgtgtgtgactctctctgca 
gactgggaaaaaacaaaataacaagtgaaggagggaagtatctcgccctggctgtgaagaacagc 
aaatcaatctctgaggttgggtgagtagaaggggatggatgtatgtggtacaacctgctgtgtgt 
gtggggggcgggccttgctgttcttttcatacatcagtacaccagaaggaccactggggctcgct 
gtcggggagagatagtggagagctttcaccatgctgcgaaactgaaaccgtgcccattaagcaat 
aactccccggtccccctcccccctgcctcttgcagccaccctgctacttactctctctatggttt 
tgactactctacctcatgtaagtggaatcatacagtatttgccttttggggatggctgatttcac 
tagcatcatgtcctcaagattcgtccacatggaagcatgggacaggatttccttttttttttttt 
ttttttttttttttgacagagtctcgctctgttgcccaggctggagtgcagtggcatgatctcgg 
ctcactgcaacctctgccttctgggttcaagcgattctctcgcctcagccacacgagtagctggg 
attataggcacccgccaccaatcccagctaatttttgtatttttagtagaggcggggtttcacca 
tgttggccaggctggtctcaaactcctgacctcaaatgatccacccacctcggtctcccaaagtg 
tcaggattataggcgtgagccaccgtgccccgccaggatttccttcttttttaaggctgagtaat 
act ccattgcatggctatgccacattttgtt tact cat t cat ccaagaacagacactggcttgct 
tctatgctttggctgttgtgaataatgctgctgtgcacatgggcatacaaatgtctcttcaagga 
ctgccttcaattcttttttttttttttttttttttttagattctttttttttttattatactcta 
agttttagggtacatgtgcacattgtgcaggttagttacatatgtatacatgtgccatgctggtg 
cgctgcacccactaatgtgtcatctagcattaggtatatctcccaatgctatccctcccccctcc 
cccgaccccaccacagtccccagagtgtgatattccccttcctgtgtccatgtgatctcattgtt 
caattcccacctatgagtgagaatatgcggtgtttggttttttgttcttgcgatagtttactgag 
aatgatggtttccaatttcatccatgtccctacaaaggatatgaactcatcattttttatggctg 
catagtattccatggtgtatatgtgccacattttcttaatccagtctatcattgttggacatttg 
ggttggttccaagtctttgctattgtgaatagtgccacaataaacatacgtgtgcatgtgtcttt 
atagcagcatgatttatactcatttgggtatatacccagtaatgggatggctgggtcaaatggta 
tttctagttctagatccctgaggaatcgccacactgacttccacaatggttgaactagtttacag 
tcccaccaacagtgtaaaagtgttcctatttctccgcatcctctccagcacctgctgtttcctga 
ctttttaatgattgccattctaactggtgtgagatgatatctcatagtggttttgatttgcattt 
ctctgatggccagtgatgatgagcatttcttcatgtgttttttggctgcataaatgtcttctttt 
gagaagtgtctgttcatgtccttcgcccactttttgatggggttgtttgtttttttcttgtaaat 
ttgtttgagttcattgtagattctggatattagccctttgtcagatgagtaggttgcgaaaattt 
tctcccatgttgtaggttgcctgttcactctgatggtagtttcttttgctgtgcagaagctcttt 
agtttaattagatcccatttgtcaattttgtcttttgttgccattgcttttggtgttttggacat 
gaagtccttgcccacgcctatgtcctgaatggtaatgcctaggttttcttctagggtttttatgg 
ttttaggtttaacgtttaaatctttaatccatcttgaattgatttttgtataaggtgtaaggaag 
ggatccagtttcagctttctacatatggctagccagttttcccagcaccatttattaaataggga 
atcctttccccattgcttgtttttctcaggtttgtcaaagatcagatagttgtagatatgcggca 
ttatttctgagggctctgttctgttccattgatctatatctctgttttggtaccagtaccatgct 
gttttggttactgtagccttgtagtatagtttgaagtcaggtagtgtgatgcctccagctttgtt 
cttttggcttaggattgacttggcgatgcgggctcttttttggttccatatgaactttaaagtag 
ttttttccaattctgtgaagaaagtcattggtagcttgatggggatggcattgaatctgtaaatt 
accttgggcagtatggccattttcacgatattgattcttcctacccatgagcatggaatgttctt 
ccatttgtttgtgtcctcttttatttccttgagcagtggtttgtagttctccttgaagaggtcct 
tcacatcccttgtaagttggattcctaggtattttattctctttgaagcaattgtgaatgggagt 
tcacccatgatttggctctctgtttgtctgttgttggtgtataagaatgcttgtgatttttgtac 
attgattttgtatcctgagactttgctgaagttgcttatcagcttaaggagattttgggctgaga 
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cgatggggttttctagataaacaatcatgtcgtctgcaaacagggacaatttgacttcctctttt 

cctaattgaataccctttatttccttctcctgcctgattgccctggccagaacttccaacactat 

gttgaataggagcggtgagagagggcatccctgtcttgtgccagttttcaaagggaatgcttcca 

gtttttgcccattcagtatgatattggctgtgggtttgtcatagatagctcttattattttgaaa 

tacgtcccatcaatacctaatttattgagagtttttagcatgaagggttgttgaattttgtcaaa 

ggctttttctgcatctattgagataatcatgtggtttttgtctttggctctgtttatatgctgga 

ttacatttattgatttgcgtatattgaaccagccttgcatcccagggatgaagcccacttgatca 

^ggtggataagctttttgatgtgctgctggattcggtttgccagtattttattgaggatttttgc 

atcaatgttcatcaaggatattggtctaaaattctcttttttggttgtgtctctgcccggctttg 

gtatcagaatgatgctggcctcataaaatgagttagggaggattccctctttttctattgattgg 

aatagtttcagaaggaatggtaccagttcctccttgtacctctggtagaattcggctgtgaatcc 

atctggtcctggactctttttggttggtaaactattgattattgccacaatttcagagcctgtta 

ttggtctattcagagattcaacttcttcctggtttagtcttgggagagtgtatgtgtcgaggaat 

gtatccatttcttctagattttctagtttatttgcgtagaggtgtttgtagtattctctgatggt 

agtttgtatttctgtgggatcggtggtgatatcccctttatcattttttattgtgtctatttgat 

tcttctctctttttttctttattagtcttgctagcggtctatcaattttgttgatcctttcgaaa 

aaccagctcctggattcattgattttttgaagggttttttgtgtctctatttccttcagttctgc 

tctgattttagttatttcttgccttctgctagcttttgaatgtgtttgctcttgcttttctagtt 

cttttaattgtgatgttagggtgtcaattttggatctttcctgctttctcttgtaggcatttagt 

gctataaatttccctctacacactgctttgaatgcgtcccagagattctggtatgtggtgtcttt 

gttctcgttggtttcaaagaacatctttatttctgccttcatttcgttatgtacccagtagtcat 

tcaggagcaggttgttcagtttccatgtagttgagcggctttgagtgagattcttaatcctgagt 

tctagtttgattgcactgtggtctgagagatagtttgttataatttctgttcttttacatttgct 

gaggagagctttacttccaactatgtggtcaattttggaataggtgtggtgtggtgctgaaaaaa 

atgtatattctgttgatttggggtggagagttctgtagatgtctattaggtctgcttggtgcaga 

gctgagttcaattcctgggtatccttgttgactttctgtctcattgatctgtctaatgttgacag 

tggggtgttaaagtctcccattattaatgtgtgggagtctaagtctctttgtaggtcactgagga 

cttgctttatgaatctgggtgctcctgtattgggtgcataaatatttaggatagttagctcctct 

tgttgaattgatccctttaccattatgtaatggccttctttgtctcttttgatctttgttggttt 

aaagtctgttttatcagagactaggattgcaacccctgcctttttttgttttccattggcttggt 

agatcttcctccatccttttattttgagcctatgtgtgtctctgcacgtgagatgggtttcctga 

atacagcacactgatgggtcttgactctttatccaacttgccagtctgtgtcttttaattgcaga 

atttagtccatttatatttaaagttaatattgttatgtgtgaatttgatcctgtcattatgatgt 

tagctggcgattttgctcattagttgatgcagtttcttcctagtctcgatggtctttacattttg 

gcatgattttgcagcggctggtaccggttgttcctttccatgtttaccgcttccttcaggagctc 

ttttagggcaggcctggtggtgacaaaatctctcagcatttgcttgtctataaagtattttattt 

ctccttcacttatgaagcttagtttggctggatatgaaattctgggttgaaaattcttttcttta 

agaatgttgaatattggcccccactctcttctggcttgtagggtttctgccgagagatccgctgt 

tagtctgatgggctttcctttgagggtaacccgaactttctctctggctgcccttaacatttttt: 

ccttcatttcaactttggtgaatctgacaattatgtgtcttggagttgctcttctcgaggagtat 

ctttgtggcgttctctgtatttcctgaatctgaacgttggcctgccttgctagattggggaagtt 

ctcctggataatatcctgcagagtgttttccaacttggttccattctccacatcactttcaggta 

caccaatcagacgtagatttggtcttttcacatagtcccatatttcttggaggctttgctcattt 

ctttttattcttttttctctaaacttcccttctcgcttcatttcattcatttcatcttccattgc 

tgataccctttcttccagttgatcgcatcggctcctgaggcttctgcattcttcacgtagttctc 

gagccttggttttcagctccatcagctcctttaagcacttctctgtattggttattctagttata 

cattcttctaaatttttttcaaagttttcaacttctttgcctttggtttgaatgtcctcccgtag 

ctcagagtaatttgatcgtctgaagccttcttctctcagctcgtcaaaatcattctccatccagc 

tttgttctgttgctggtgaggaactgcgttcctttggaggaggagaggcgctctgcgttttagag 

tttccagtttttctgttctgttttttccccatctttgtggttttatctacttttggtctttgatg 

atggtgatgtacagatgggttttcagtgtagatgtcctttctggttgttagttttccttctaaca 

gacaggaccctcagctgcaggtctgttggaataccctgccgtgtgaggtgtcagtgtgcctctgc 

tggggggtgcctcccagttaggctgctcgggggtcaggggtcagggacccacttgaggaggcagt 

ctgcccgttctcagatctccagctgcgtgctgggagaaccactgctctcttcaaagctgtcagac 
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agggacacttaagtctgcagaggttactgctgtctttttgtttgtctgtgccctgcccccagagg 

tggagcctacagaggcaggcaggcctccttgagctgtggtgggctccacccagttcgagcttccc 

ggctgctttgtttacctaagcaagcctgggctatggcgggcgcccctcccccagcctcgttgccg 

ccttgcagtttgatctcagactgctgtgctagcaatcagcgagattccgtgggcgtaggaccctc 

tgagccaggtgtgggatatagtctcgtggtgcgccgtttcttaagccggtctgaaaagcgcaata 

ttcgggtgggagtgacccgattttccaggtgcgtccgtcacccctttctttgactcggaaaggga 

actccctgatcccttgcgcttcccaggtgaggcaatgcctcgccctgcttcggctcgcgcacggt 

gcgcgcacacactggcctgcgcccactgtctggcgctccctagtgagatgaacccggtacctcag 

atggaaatgcagaaatcacccgtcttctgcgtcgctcacgctgggagctgtagaccggagctgtt 

cctattcggccatcttggctcctccctccaattcttttgggtatatatccagcagtgggattgct 

ggatcacatggtaatttttaattttttgaagaatcatcatactgttttccacggcagcagcacca 

ttttatgttcccaccaacagttcattctagtttctccacatccttgccaacacttgctattttct 

ctttttgacagtacccatcctaatgagtgtgaggtcctgtctcattgtggttttgattcttgagg 

ctttttaaagctttttgtttcattataatttttattggattacaaaaggaacacaggtaatttta 

tttggaaactatgaaaaataataaaaattatcttctcagaaaatgattcttgttaacatttaagc 

tcagttaagctctctcactttctctcccttctctctctttgtacaacttttaaaaaatatagtag 

gggtgagactatatgtatctatactatagtaggggtgagactatatgtatccttcctttttcact 

taatctcatgccttgagtagctttccactttattaaaaatgtgatgccattcaattgtatagtaa 

atacatatatgtaagcaaaacactgaaaactcttattctgggttccagcaagccatacctggaat 

ggtgtaagcaggtagtttgcttggtgtgaacgtgttgttgaggcagctgccattgtgttgtgagt 

gggccacacgaacttgttctgttgtgtgtagacagtgtgtgctgatcctattaggaacagccaac 

gctttgtgtgagccacacacggttctaagtgctttgcttctgttaactcagtgaatcctcacaac 

tccatgacggaatgctctaattatccccattttatagatggggcaactgaggtccaagagactac 

ataatttcccgaagttcacacaggtagcagatggcagagccgggtcaggagtccaccatcttacc 

acgcagactgttttagccagagactctccggatctgctgtaggggacagaatacagctttatcgc 

cgcacctgtccaccaagatggccgtagccacagagcttggttgggtaacgtcctctttatgtgac 

aggaacgttgctgatggggtttctgaaggtacttcctgctctttgtctcctggaagactgtgtct 

tcaggaatgtctctgaccctgcccagagttgaacggatgctgggaacccagcacctgcacacggc 

cttccctccaggactctgcgcacctctgtgctccacaggagacatgcaggtgctttctctcatga 

gctcaggctcctgggctgacagctctccgaagctcgtggtgaggctcggtctctaactgtgccac 

ttgccgatggcctctgttcacaaggcttcccctgctcttcgatcttgcatcaccccttgaatttg 

aaatccagagcagcccactcagagaccagtgtgaggaattagtgtccaggccacagatccaggga 

ctgggcacaaacatctgcctgttgagtaggaactgagctgtggccattggcaaaaaaggaggggt 

gagcatggctgtttcttggggagctaacattcactatcttgtctcctccctcaggatgtggggca 

atcaagttggggatgaaggagcaaaagccttcgcagaggctctgcggaaccaccccagcttgacc 

accctgaggtaactgtggccctgctgtctccaggggccaacctggtccctcccagctgctctagg 

tttgctggggaagggtgattcgtgctcctaatagaagaggaatttgcatgtgtgattttccttac 

tcttgtcaaacctttctttgatgcataagaggccatctagtaaagcacattcttctcttttttta 

actttaagttctgggatacatgtagaagatgtgcaggtttgttacataggcaaatgcatgccatg 

gtgatttgctgcacctatcaacctgtcatctaggttttaagccctgcatgcattaggtatttgtc 

ctaatgcttgccctccccttgccccccacccccaacaggccctggtgtgtgttgttcccctccat 

gtgtccatgtgttctcattgttcaactcccacttacgagtgagaacatgcagtgtttggtttttt 

gttcctgtgttagtttgctgagaatgatggtttccagcttcatccatgtgccagcaaaggacatg 

atctcatttttttttatggttgcatagtattccatagtgtgtatgtgccacattttctttatcca 

gtctatcactgatgggcatttgggttggttccaagtctttgctattgtaaatagtgctacaataa 

acatacatgtgcttgtgtctttatagcagaatgatttataatcctttgggtaaatacccagtaat 

gggattgctgggtcaaatggtatttctggttctagatccctgaggaatcaccttaagtgtttatt 

cagctcagtgaattctgcatgtgtcccacaccagccaaccaccacccccatcaagacagaggaca 

tttccagcccctcagccatccctgcatgtcccttgctggtagagggagggtttcctaagtgcaga 

tgaaacttaataagatgctggccagcagattcctgccccttccttgtcctcaggatgatgctgga 

aaagagggactcttcctctctataaatggggatgcacctacccagcccccgcttaggctgctggc 

caaatcttgggaccttggtatgtccacggctctgctgctgttcttcctaccactgaaaaagagtc 

caagaaggtggggacagtagcagaagagactttgccaggtcttgcagatggggtaccttgatggg 

gccagcctttagaaggacagcttgccaggcctcgccagcctcctgcccatgtgcagaaacctgag 
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gtgccgaccccagcccactgttgtqtgaqcaggctgtgctgatgacccatttcccgtccagcctg 
cccttgtgctctgtgtgtgggctctggggcagcagcgcctgggcactactgctgcagctgaacac 
ttctgcatcctgccccgagtgagcctgggctggggccacagccaggcagaggcttcccagctgtt 
ctgatgttgaagctaagattgaatgtagatgtgtctttaataattcaccccaagtgtgttccttc 
ctagtcttgcgtccaacggcatctccacagaaggaggaaagagccttgcgagggccctgcagcag 
aacacgt etc tagaaa tact gtggtaatagctcgagtcatttcatttgtt tgt ttgtttttctgt 
gatagggtcttgctttgtcgtccaggcttgagtgcattggtgtgatctcagctcactgcagcctc 
cacctcccaggctcattcgaacctcccgccttggccttccgagtcctgagactataggcatgcac 
caccacacccagttaattttaaaattttttgtagagatggggttttgctatgttacccaggctgg 
tcttgaactcctgggctcaagcagttctcctgccctggcttctcaaagccctgggattgcaggtg 
tgagccactgcacctggcacagagtcattttggagggtttaggtcccaggaattatcccaggggc 
tgcacatggcctggaatcttaacagaaaaggtgtctcccaattggaaaggctctaggcctttcag 
ttaagttgataatttcctcctagagaagagaatagccacttctacaagcataaacaggtacagga 
ggaggaagtgggctccgggagcctggatctgaggccttggccttctaggccccaggagaactaga 
acgctggccatgcaagctatccaggtatccttggataccttcagatgtgcttagcagaggccaac 
ttccacacacttggctcaaaattttctcccttcctcctcttcatctgccttcccccaggcagcct 
cctccttccccaggtcttcacatcagggtttggcctttatgctccatccagctcatctgtcactt 
gtcacctgaagcccacagtcctcgctccctctctgcactctagggcacttactaagtggatgtgg 
cctcctgagagtgttttttgttggtgttcccttttttatggccacttaatgttttattttgcttt 
atttgtatttacatctctgtatcataaattccatacaggtggctgggagcagtgactcacatctg 
taatcccagtactttggaaggctgaggtgggaggatcgcttgaggccaagagttcgagactagcc 
tgggcaatatagcgagaccctctatctacaaaaaaaaaaaacattccttacaggttaagtgaggg 
agttgtattacaaccctccctatcatctactcagagcccagtgctcatttgatcttgctaaatta 
gttactgagaataatgacaatatcctcttcatgagagagttttgacattaggcctgctgtccagt 
aagtgcattttaaattctttcccctcaacaaatcatttaacattttgaaaagtagtttatgtttt 
ttggaaaaaatgtaagacactaaaggaggacatgaaagtacctcctaaagttcctgctaaaagga 
ggaagtgaaagtacctccctttgtgttttccaaaataacctttcctttctagccttttgttctat 
gtatgttcaaagatatgcaaaacagaatagcattcaagcagtggctctaaaaatattgtaatcac 
atactttacatgtctcctttagggtttctccatcttgatgctgttgacattttggtccaagtgat 
tctttattatggtagggctgtcctgtgcatcatagacggtttagccgcatctctgccctgtacct 
cccagtggtgaggatcaaaaatgcctccggacatggccaggtgccccatggagagtgaaatcaca 
tggatagtagtaatgtcaacacctagaagccctcaagtgctgactgcatgccatgtgttattcta 
cactttttccctgtgttaactcactcagcctcacaaccactctatacgatctctactgttaacgt 
tcaccagtgagaaaactcagacccaaagaacttaagcctgttgcccgaggtcaccctgctggtgg 
gtgatacaaacctgcccaggctgagtccggagtagatgtcaatgctgtgttcttctccctcctca 
ttctacctcattctccctacaagctgcacaacatctcgaatagatatcacaatatatttcatcag 
ttgtttctgatctaaatttgttcagattttacattaggataataccacaatgcatgctgcaatgt 
ataaagctttgtgtgtatatccttgcacactgtagggtaaatttctagaagtctgattgtcttaa 
aatgaagcacattaaaaatttgggcaggcacatccaaactgcccttcaaggaattttttttttta 
aatgttctttctgttctattcttcttcctaatgattctttcgtccactggcacaagtgggtccta 
ccctgtttacaccaaggagctttggtgctttatccagaccacttctggttctaaggaccattgag 
agacttcctgaactttcagtcacttaacttgggtccctcacaagttaactgagagcaaagtactg 
aacacattttaatgtgcagtcagtgactgtttcaggtcttcaaactaacttggataacacactgt 
cagtggtgttcaagggaccctgggactagaggagaactgagaagcaggcattggccctttgtttt 
ccgtgggcccccatcttccatgaaatctgagggctcagcaaaggtggggagggagggtgggctcc 
tctacaggtagctgggctaagaaataggagcccaggtacaggatttgcattaaaaatgagtccca 
ttgaccttctgtggggctgacaggctgggcttggagcctggctgttttctgggttctcagcaagt 
gatcatctgcatagctggagagccttgggctgagctcccgctcctgtgaactctaaaacaatgtc 
tgccaagtaggctctcttgagtaaatacttccttttttttccttaggctgacccaaaatgaactc 
aacgatgaagtggcagagagtttggcagaaatgttgaaagtcaaccagacgttaaagcatttatg 
gtaactcagagagccttacaatttcagactgtgctacttttcaaaagtattttttgagataaaat 
ttacatactgtaaaattcactctcttaaagtatacaattcagaggtttttagtgcaaccatcacc 
acctaattctagaacattttcactcctcctccccactccaaaaagccctggtatccattaagcag 
tcactccctgtcctcctccccagaccctggcaaccactaatccgctttctgtctctatggatttg 
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cctactctgggcatttcatataaatggaatcaagcaatatgtgaccttttgtctctgtgttctag 

catgtttcattcctttttatggctaaatgataattcactctaaggaaattttgcagtttattaat 

cagttgatgggacatttgggttgtttctactttttgactattatgcgtaatgctactgtgaacac 

tcctgttcatgcttttgggtgaacatatgttttcatctcttttgggaatatacctgggaatagaa 

tttctgggtcatatggcaattctgtaactttttgaggagccaccaaactgttttctaaagtggat 

gtactattttacattctcgccagcaatgtatgtggattccaatttctccacatcctcaccaacac 

ttattattgtccatcttttaaaatctagttatactagtggatgtgaagtaatattgtggttttga 

tttgcatttccctgatgacaacaatgttgaatgtctttttatgtgcctactgggagtctgtatag 

cttctttggagaaatgtctccatatcctttgcccattttaaaattgggtttgtcttctaatgctg 

agttataggggttctctatatattctgggtgctagacctttactagatacaggttttgcaagtat 

tttctttctttctgtggagtttttcctctttcttgatagtgacctttaaaggacaacagttttta 

atttttgttttttttgagatggagtcttgctcttgtcacccagacaggagtgcagtggcatgatc 

tcagctctctgcaacctccacctcctgggttcaagcgattcttctgcctcagcctcctgagtagt 

tgggattacaggcatcagccaccatgcctgtctcattttgtatttttaatagagatggggtttca 

ccatttaggcccaggctggtcttgaactcctgacctcaggtgatccacctgcctcagcctcccaa 

agtgctgggattacaggcgaaaagccactgcacctggccaatagtttttaattttgatgaagtcc 

aatttatctatttttttctttggttgcttgtgctttcagtgtcttatctaagaaatgattgccta 

atccaagatcacaaagaactccacctaagttttctgttaagcgttatagttgtttcccctcacat 

ataggtctgcaatccattttgagttaatttttgtatagtgtaaagtgagggttaacctcattctc 

ttgcacgtggatatccagctgtcccggcagcaccacgtgttgaacagattatcttttcctattga 

atggccttgacacccttgtcaaaaatcaattgaccataaatgtatgggtttatttctgaattctc 

tgttctggtccattgatttatatgtctctcctatgccaggaccattgctgtagctttgtgtagta 

cattttgaaatcaggaggtgtgagttctactttgttcttctttctcaagattgtttagaccattc 

tgggttctttgcatttcttatgaattcagactcaccttgtcaatttctgcaaaaagactagactc 

tgctacatattgtttttt.ctttcctttttagcctgcagaattatttgatcccattccctaagtgc 

aggccagcctctccagggagagcagagctaggacagggtcagaaagagagtcttggctgctttgt 

gcattcccaacctgcactggccctagtgaaggcagcccgagtgggtggatgtgcctggacactgc 

aggctttttaggggcattaggtgctctccttcctggcctcctgccacatcttggttggaggctgc 

cttccctgccttcaaaaaagcctaagtggtgactagaaaacagcagagtgtaactgaatacagaa 

cttggtgcccacttcctggttctatttttgtcccttttgaaagggaaggtcattacctctgccat 

tgaacccaggggccctagcccttgtggggtatggctgggagcaccagatcctggctgcagcccag 

ccaccagtggtcctgtgtgcttgggcagtaacagtgacaagagctcccttccccctggacactgt 

gcctaataccctcctcttgaaatctcacacacccagtggatggggggcactcttatagttattct 

cagtttacagatgacacaactgaggcacagacagatgcgtttatttcttcaaggttctgtagctg 

aacagtggggagggagggtttaagaggagctgcacccgctctgcaatactgcctctcacgaggga 

gtcctcttcattcatgacagcatagggccctcgtcttcctggtaagggcttccttcttgggtcag 

tgccaggatttctaagggtcatgtttagcaggagcctattctacaaacagccaggagcagggaat 

gactctgtgatgaagcggagacactacagcctcttgatgcatttatttcctggttgggttagaag 

cgtagctgcccaagggagcatttcaggagaggcctggcttcctagcgatagctgaaaactttgtt 

tcatttgaatcactgctacccagaacaatggggtgcattctcagagtccccattattaaagcttt 

tccactgagccccatgagaactattcatgagaactatttcatggcagcataactgtttctcctcc 

ctccctcttgcatgttggtagcctcttaactttaaaacctgccttgcctttccctagctacctgg 

aaggagacgtcagacttcctgtcccatggtgtgtttcttacaatttgttgttcagattggtggtc 

tcccaaatatatataaaaatataaatggagtctcactctgtcacccaggctggagtgcagtggca 

cgatcttggctcactgcaacctccacctcccagttcaagcaattctcctacctcagtctcccgag 

tagctgggagtacaggtgcacaccaccatgcccagctaatttttttgtatttttaatagagacag 

gttggccaggatggtatcgatctcctgagctcgtgatccacccacctcggcctcccaaagtgctg 

ggattacaggtgtgagccactgcacccggccccaaatattttgattatgcacctctgcagtgaaa 

aatgcaaacacacacatcagttcatgtattacattatgttcactataaaaacaaacagaaaattt 

aaaaaatatcaagctatcctttactctagtggatcttacctggacacttttagccagatacaaag 

tcacatggactcagttcttcccctgaccaacttgtctcttatcccaaaacacccttgcaactccc 

ttacgaaggggtcaaatttgatccagtattatggattttatacaagttatgttcttctttcaggc 

ttatccagaatcagatcacagctaaggggactgcccagctggcagatgcgttacagagcaacact 

ggcataacagagatttggtaagatcccagcgtttgtcacagtaataacaccagtgactgtttact 

FIG. 18 (8 of 10) 



33 / 35 



WO 99/40102 



PCT/US99/02544 



caccaccactgactgtgcaaggcacaacgcagggtggtttctgtttattcctccagcaaccctgc 
acagtaatggtattacctctgttttacagaggtagacagaggcccagaccagtgaaataaggttg 
cccaaggtcactacgagagaagctagaattcagcccagaatgcctgattccatattctgtgctct 
cctgccctgggcccccgccctcatctaccttcattgggtgggatgggggaagtggccagtgaaat 
gatttcctagtggaagtaaatccccctgggactcagcaattgagagatgactgtgttggccagga 
gtttggagctcattcttccccttttctgggttccgtaagacatttccaggctgacttgaactgac 
ctgtgctctttgtctacttcttttttctgctttgagaacttccttatgctaatagaagaaaaaaa 
gtttgctttactgtgacattgagcgccatgccacttctttcttgcctcccataaggcacagacac 
tccccactcagcagctcccttaacaacttaattgcctgggtgacgtgggactgggtggatgctgg 
gagaggggccttattaactatgtcctcctttcatgactggggagaatttcatagccaattaaaaa 
aaaacaaaaaacagctccttggccaacacaggctcctcatacagtgttttttaaactttgcttta 
gaacttgtttggaacttgtcataaaatcgatcagtttggtgaattgcaaccaacaatatttaaaa 
agaaaacagaacagaacaaaatatcaggatgcaatgtgcatggtatgaaagtatcatttcattca 
tcttagttcatgcttgcatgtgagtgggtgtgtgtttgcataagtgttggttcacaacataaaat 
gtaattcttatttagggttgtagacaaaaggtttttttttaaaaaaaacactgttggctaggcat 
ggtggctcatgcctgtaataccagcactttgggaggccaagatgggcagatctcttgagcacagg 
agtttgagaccagcctgggcaacatgcgaaaccccgtcactacaaaaattagcccgacatggtgc 
tatgtgcctgtagtcccagctactcaggaagctgatgtgggaggatggatgcatgggagatcaag 
gctgcagtgagccaggatcatgccactgcaatccagcctgggtgccagagaccctgtctcaaaaa 
acaaaaaagaaaaaaagaaaaacaccatcatagagaatagagcccagatctaaacagacacctgt 
ggcctgtgtgcctgcgaagcccagcctgcccagcagcctgggaagcactggagggcactggaact 
gtttgcatgggtgtttgccctcaggccactccgtttctgctgattcttaagttttgaggacagca 
ggcagagggggagaggaaggagactgccagactacagaacagtttgcagagcacagttggct tec 
acttttctctgtagctggtcaggcgggtagtaaagacctacagttgctttaattctgtcaagttt 
caaaatctgeat tgcttccctcttgagggtcaccattcctacacaaggaaccattt tagtagggc 
caggagacttcagcttcaaggcctgcacttgtgtcagggtggagaggggaactggccaccaattc 
agagagggcaggacaggcggcatgggtgctggtcttgggagtgtcttcacttaggtccctggctt 
gttctgggagcctccagagcatgctcctctgtgtgtgacttcatgggactgggctctgagaaggc 
tgtggctttgttggccctgccagggactgccacaccaggccacagggttgtggttgagctggccg 
gggagccacgttcagggagcagctctgcttggagccaacacttacagagtaagccttctccttgg 
acttgttaactgtactgacacttatttctacctcattcctttctgaaaataacttggaagtctga 
agtcccttgatgagttctgtctttaagaacagaaattagaggtgaacaatgaacactgtaaatta 
cagaaatgtatcccactccagtataacagctttctgtgaggctatctcctccagactgtggctct 
gggagggtggggcctgagtcaaggtcctagggactagtgctgtgtcttcatttattccttgaata 
acgaaacgcttgagcatcagggactgtgctagcaccaaaaatccagtggtgaacaacatggcttc 
atgggttcactgtctagaaagggagaagcacattaaagaaaaaatcatttgcgtaattatttaat 
tacaactgtgatgggtactatcacaaaggggaaggccaagagggaacctgatttagatgaggttg 
cagggaaggcctctctgaggaagcagcacttacactaagccatgaaggatgaataggagctagtc 
agctgaggtgagtattctgcgtagggaacagcatgtgcaaagggtctggggcaggagggagtgtg 
gtgtcctggaagaactgccagaagctgctgtgccccagggttcagacagtgtggaagaggggact 
acaggaggctgaggagataggcagggactggaccataaaagatctgtgggtcatgatgtgcattt 
tggtctttatcctaaaagtgatggaaagtcagtgaacagtttgaagcaggagaggcatgtgatca 
gatctgcaatgcaaaaagaccaattcttggctcttctaggaaactgaattggagaaggccagagt 
acgtggaaatgacctgtcagtaggacattgtactgatgcagggaagagatgatgggtgctcagac 
caagatggccggccaaagacatagaggttccagggaggcattctagattcttaggaattagggga 
gaactttgtgatacaaggaacatggggatgagaaggaaggtgtccaggttgacccccaggttact 
aacctgctcagcaggatgagagtggtccattcactaagccaggggaccctaggaggtgtggctac 
tttgaggtgtgggggagaggtccaagtgaggatgccaagcaggtaactgcctccacggacataca 
aacaaggccgtggcattgatgagatcgggtggggaaaagggcttagccccaaacctggaggaaat 
ctcagatgtagaggtcacatggaggagaatataggaaaggaaattgaagtagagtgctcagatgc 
a 99 a g aaaaa tcagcgcatataaccaagccaaggggagggagtgcctcaagaaggagggagagga 
gaggtcaggacagccaaaatcctgagggccaagaaagacaagacctggaaaatgtcattaaattc 
aggcttatggaggctacaggtgaccttagtgagacccagtgaacagagggatggcagctggagag 
gatccatgctaatatgaaggaactatctgcaaagggtatgttccttaatttcagggatacatgtg 
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tattgtgtgatacacgagtgtgtgctatgaacacaccttgggaaggagtgtgcgaggatccttaa 
cattttacctgtgtacttttgtcttcctccttttcaacagcctaaatggaaacctgataaaacca 
gaggaggccaaagtctatgaagatgagaagcggattatctgtttctgagaggatgctttcctgtt 
catggggtttttgccctggagcctcagcagcaaatgccactctgggcagtcttttgtgtcagtgt 
cttaaaggggcctgcgcaggcgggactatcaggagtccactgcctccatgatgcaagccagcttc 
ctgtgcagaaggtctggtcggcaaactccctaagtacccgctacaattctgcagaaaaagaatgt 
gtcttgcgagctgttgtagttacagtaaatacactgtgaagagactttattgcctattataa 
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NOVEL MOLECULES OF THE CARD - RELATED PROTEIN 
FAMILY AND USES THEREOF 

5 Cross Reference to Related Applications 

This application is a continuation-in-part of U.S. 
Application Serial No. 09/207,359 filed December 8, 1998, 
which is a continuation-in-part of U.S. Application 
Serial No. 09/099,041, filed June 17, 1998, which is a 
10 continuation-in-part of U.S. Application Serial No. 

09/019,942, filed February 6, .1998. The contents of each 
of these applications is incorporated herein by this 
reference . 



Background of the Invention 

15 In multicellular organisms, homeostasis is 

maintained by balancing the rate of cell proliferation 
against the rate of cell death. Cell proliferation is 
influenced by numerous growth factors and the expression 
of proto- oncogenes, which typically encourage progression 

2 0 through the cell cycle. In contrast, numerous events, 
including the expression of tumor suppressor genes, can 
lead to an arrest of cellular proliferation. 

In differentiated cells, a particular type of cell 
death called apoptosis occurs when an internal suicide 

25 program is activated. This program can be initiated by a 
variety of external signals as well as signals that are 
generated within the cell in response to, for example, 
genetic damage. For many years, the magnitude of 
apoptotic cell death was not appreciated because the 

30 dying cells are quickly eliminated by phagocytes, without 
an inflammatory response. 

The mechanisms that mediate apoptosis have been 
intensively studied. These mechanisms involve the 
activation of endogenous proteases, loss of mitochondrial 
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function, and structural changes such as disruption of 
the cytoskeleton, cell shrinkage, membrane blebbing, and 
nuclear condensation due to degradation of DNA. The 
various signals that trigger apoptosis are thought to 
5 bring about these events by converging on a common cell 
death pathway that is regulated by the expression of 
genes that are highly conserved from worms, such as C. 
elegans, to humans. In fact, invertebrate model systems 
have been invaluable tools in identifying and 

10 characterizing the genes that control apoptosis. Through 
the study of invertebrates and more evolved animals, 
numerous genes that are associated with cell death have 
been identified, but the way in which their products 
interact to execute the apoptotic program is poorly 

15 understood. 

Caspases, a class of proteins central to the 
apoptotic program, are cysteine protease having 
specificity for aspartate at the substrate cleavage site. 
These proteases are primarily responsible for the 

20 degradation of cellular proteins that lead to the 

morphological changes seen in cells undergoing apoptosis. 
For example, one of the caspases identified in humans was 
previously known as the interleukin- la (IL-la) converting 
enzyme (ICE), a cysteine protease responsible for the 

25 processing of pro- IL- la to the active cytokine. 

Overexpression of ICE in Rat-1 fibroblasts induces 
apoptosis (Miura et al . , Cell 75:653, 1993). 

Many caspases and proteins that interact with 
caspases possess domains of about 60 amino acids called a 

30 caspase recruitment domain (CARD). Hofmann et al . (TIBS 
22:155, 1997) and others have postulated that certain 
apoptotic proteins bind to each other via their CARDs and 
that different subtypes of CARDs may confer binding 
specificity, regulating the activity of various caspases, 

3 5 for example. 
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The functional significance of CARDs have been 
demonstrated in recent publications. Duan et al . (Nature 
385:86, 1997) showed that deleting the CARD at the 
N-terminus of RAIDD , a newly identified protein involved 
5 in apoptosis, abolished the ability of RAIDD to bind to 
caspases. In addition, Li et al . (Cell 91:479, 1997) 
showed that the N- terminal 97 amino acids of apoptotic 
protease activating factor- 1 (Apaf-1) was sufficient to 
confer caspase-9-binding ability. Inohara et al . (J. 

10 Biol. Chem. 273:12296-12300, 1998) showed that Apaf-1 can 
bind several other caspases such as caspase-4 and 
caspase-8. Apaf-1 can interact with caspases via 
CARD-CARD interaction (Li et al . , supra, Hu et al . , PNAS, 
95:4386-4391, 1998) . 

15 Nuclear factor- kB (NF-kB) is a transcription 

factor expressed in many cell types and which activates 
homologous or heterologous genes that have kB sites in 
their promoters. Quiescent NF-kB resides in the 
cytoplasm as a heterodimer between proteins referred to 

2 0 as p50 and p65 and is complexed with the regulatory 

protein IkB. NF-kB binding to IkB causes NF-kB to remain 
in the cytoplasm. At least two dozen stimuli that 
activate NF-kB are known (New England Journal of Medicine 
336:1066, 1997) and they include cytokines, protein 

2 5 kinase C activators, oxidants, viruses, and immune system 
stimuli. NF-kB activating stimuli activate specific IkB 
kinases that phosphorylate IkB leading to its 
degradation. Once liberated from IkB, NF/cB translocates 
to the nucleus and activates genes with kB sites in their 

30 promoters. How all of these NF- kB activating stimuli act 
is unknown at the present time and it is presumed that 
novel NF- kB pathway components are involved. NF - kB and 
the NF-kB pathway has been implicated in mediating 
chronic inflammation in inflammatory diseases such as 

35 asthma, ulcerative colitis, rheumatoid arthritis (New 
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England Journal of Medicine 336:1066, 1997) and 
inhibiting NF-kB or NF-kB pathways may be an effective 
way of treating these diseases. NF-kB and the NF-*B 
pathway has also been implicated in atherosclerosis 
5 (American Journal of Cardiology 76:18c, 1995), especially 
in mediating fatty streak formation, and inhibiting NF-kB 
or NF-/cB pathways may be an effective therapy for 
atherosclerosis . 



Summary of the Invention 

10 The present invention is based, at least in part, 

on the discovery of genes encoding CARD-3 and CARD-4 . 
The CARD-4 gene can express a long transcript that 
encodes CARD-4L., a short transcript that encodes partial 
CARD - 4 S , or two CARD-4 splice variants. A murine full 

15 length cDNA sequence for the murine ortholog of CARD - 4 L 
is also presented. CARD-3 and CARD-4 are intracellular 
proteins that are predicted to be involved in regulating 
caspase activation. CARD-4 is found to activate the 
NF-kB pathway and to enhance caspase 9 -mediated cell 

20 death. In addition, proteins that bind to CARD-4 are 
presented including CARD-3 and hNUDC. 

The CARD-3 cDNA described below (SEQ ID NO:l) has 
a 1620 open reading frame (nucleotides 214 to 1833 of SEQ 
ID NO:l; SEQ ID NO : 3 ) which encodes a 54 0 amino acid 

25 protein (SEQ ID NO : 2 ) . CARD-3 contains a kinase domain 
which extends from amino acid 1 to amino acid 3 00 of SEQ 
ID NO: 2; SEQ ID NO : 4 , followed by a linker domain at 
amino acid 301 to amino acid 431 of SEQ ID NO: 2; SEQ ID 
NO: 5 and a CARD at amino acid 432 to amino acid 54 0 of 

3 0 SEQ ID NO:2; SEQ ID NO : 6 . 

At least four forms of CARD-4 exist in the cell, a 
long form, CARD - 4 L , a short form, CARD - 4 S , and two splice 
variants, CARD-4Y and CARD - 4 Z . The cDNA of CARD - 4 L 
described below (SEQ ID NO: 7) has a 2859 nucleotide open 
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reading frame (nucleotides 245-3103 of SEQ ID NO : 7 ; SEQ 
ID NO: 9) which encodes a 953 amino acid protein (SEQ ID 
NO: 8) . CARD-4L protein possesses a CARD domain (amino 
acids 15-114; SEQ ID NO:10). The nucleotide sequence of 
5 the full length cDNA corresponding to the murine ortholog 
of human CARD-4L is presented (SEQ ID NO: 42) as is the 
predicted amino acid sequence of murine CARD-4L (SEQ ID 
NO: 43) . A comparison between the predicted amino acid 
sequences of human CARD - 4 L and murine CARD-4L is also 

10 depicted in Figure 17. 

Human CARD - 4 L is also predicted to have a 
nucleotide binding domain which extends from about amino 
acid 198 to about amino acid 397 of SEQ ID NO: 8; SEQ ID 
NO: 11, a Walker Box "A", which extends from about amino 

15 acid 2 02 to about amino acid 2 09 of SEQ ID NO : 8 ; SEQ ID 
NO: 12, a Walker Box "B", which extends from about amino 
acid 2 80 to about amino acid 2 84, of SEQ ID NO: 8; SEQ ID 
NO: 13, a kinase la (P-loop) subdomain, which extends from 
about amino acid 12 7 to about amino acid 212 of SEQ ID 

20 NO: 8; SEQ ID NO: 46, a kinase 2 subdomain, which extends 
from about amino acid 273 to about amino acid 288 of SEQ 
ID NO: 8; SEQ ID NO: 47, a kinase 3a subdomain, which 
extends from about amino acid 327 to about amino acid 338 
of SEQ ID NO: 8; SEQ ID NO: 14, and ten Leucine -rich 

2 5 repeats which extend from about amino acid 674 to about 

amino acid 950 of SEQ ID NO : 8 . The first Leucine-rich 
repeat extends from about amino acid 674 to about amino 
acid 7 01 of SEQ ID NO : 8 ; SEQ ID NO: 15. The second 
Leucine-rich repeat extends from about amino acid 702 to 

3 0 about amino acid 72 7 of SEQ ID NO: 8; SEQ ID NO: 16. The 

third Leucine-rich repeat extends from about amino acid 
728 to about amino acid 754 of SEQ ID NO: 8; SEQ ID NO: 17. 
The fourth Leucine-rich repeat extends from about amino 
acid 755 to about amino acid 782 of SEQ ID NO: 8; SEQ ID 
35 NO: 18. The fifth Leucine-rich repeat extends from about 
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amino acid 783 to about amino acid 810 of SEQ ID NO : 8 ; 
SEQ ID NO: 19. The sixth Leucine-rich repeat extends from 
about amino acid 811 to about amino acid 838 of SEQ ID 
NO: 8; SEQ ID NO : 2 0 . The seventh Leucine-rich repeat 
5 extends from about amino acid 83 9 to about amino acid 866 
of SEQ ID NO: 8; SEQ ID NO : 2 1 . The eighth Leucine-rich 
repeat extends from about amino acid 8 67 to about amino 
acid 8 94 of SEQ ID NO : 8 ; SEQ ID NO: 22. The ninth 
Leucine-rich repeat extends from about amino acid 895 to 

10 about amino acid 922 of SEQ ID NO : 8 ; SEQ ID NO: 23 and the 
tenth leucine-rich repeat extends from about amino acid 
923 to about amino acid 950 of SEQ ID NO : 8 ; SEQ ID NO: 24. 

The partial cDNA of CARD-4S described below (SEQ 
ID NO: 25) has a 1470 nucleotide open reading frame 

15 (nucleotides 1-1470 of SEQ ID NO:25; SEQ ID NO:27) which 
encodes a 490 amino acid protein (SEQ ID NO: 26) . CARD-4S 
protein possesses a CARD domain (amino acids 1-74 of SEQ 
ID NO: 26; SEQ ID NO: 28) . CARD- 4 S is predicted to have a 
P-Loop which extends from about amino acid 163 to about 

20 amino acid 170 of SEQ ID NO: 26; SEQ ID NO: 29, and a 

Walker Box "B" which extends form about amino acid 241 to 
about amino acid 245 of SEQ ID NO: 26; SEQ ID NO: 30. 

A human CARD- 4 Y nucleotide cDNA sequence is 
presented (SEQ ID NO: 38) as is the amino acid sequence of 

2 5 the predicted CARD-4 Y product (SEQ ID NO: 39) . A human 
CARD-4Z nucleotide cDNA sequence is presented (SEQ ID 
NO: 40) as is the amino acid sequence of the predicted 
CARD- 4 Z product (SEQ ID NO: 41) . A comparison of the 
CARD-4Y, CARD - 4 Z , and human CARD-4L predicted amino acid 

30 sequences is also shown in Figure 14. 

Like other proteins containing a CARD domain, both 
CARD-3 and CARD-4 are expected to participate in the 
network of interactions that lead to caspase activity. 
Human CARD-4L is expected to play a functional role in 

35 caspase activation similar to that of Apaf-1 (Zou et al . , 
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Cell, 90:405-413, 1997). For example, upon activation, 
CARD - 4 L might bind a nucleotide, thus allowing CARD-4L to 
bind and activate a CARD- containing caspase via a 
CARD-CARD interaction, leading to apoptotic death of the 
5 cell. Accordingly, CARD-3 and CARD-4 molecules are 
useful as modulating agents in regulating a variety of 
cellular processes including cell growth and cell death. 
In one aspect, this invention provides isolated nucleic 
acid molecules encoding CARD-3 or CARD-4 proteins or 

10 biologically active portions thereof, as well as nucleic 
acid fragments suitable as primers or hybridization 
probes for the detection of CARD-3 or CARD-4 encoding 
nucleic acids. 

The invention encompasses methods of diagnosing 

15 and treating patients who are suffering from a disorder 
associated with an abnormal level or rate (undesirably 
high or undesirably low) of apoptotic cell death, 
abnormal activity of the Fas/APO-1 receptor complex, 
abnormal activity of the TNF receptor complex, or 

20 abnormal activity of a caspase by administering a 
compound that 'modulates the expression of CARD-3 or 
CARD-4 (at the DNA, mRNA or protein level, e.g., by 
altering mRNA splicing) or by altering the activity of 
CARD-3 or CARD-4. Examples of such compounds include 

25 small molecules, antisense nucleic acid molecules, 
ribozymes, and polypeptides. 

Certain disorders are associated with an increased 
number of surviving cells, which are produced and 
continue to survive or proliferate when apoptosis is 

30 inhibited. These disorders include cancer (particularly 
follicular lymphomas, carcinomas associated with 
mutations in p53, and hormone -dependent tumors such as 
breast cancer, prostate cancer, and ovarian cancer) , 
autoimmune disorders (such as systemic lupus 

35 erythematosis, immune -mediated glomerulonephritis) , and 
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viral infections (such as those caused by herpesviruses, 
poxviruses, and adenoviruses). 

Failure to remove autoimmune cells that arise 
during development or that develop as a result of somatic 
5 mutation during an immune response can result in 

autoimmune disease. One of the molecules that plays a 
critical role in regulating cell death in lymphocytes is 
the cell surface receptor for Fas. 

Populations of cells are often depleted in the 

10 event of viral infection, with perhaps the most dramatic 
example being the cell depletion caused by the human 
immunodeficiency virus (HIV) . Surprisingly, most T cells 
that die during HIV infections do not appear to be 
infected with HIV. Although a number of explanations 

15 have been proposed, recent evidence suggests that 

stimulation of the CD4 receptor results in the enhanced 
susceptibility of uninfected T cells to undergo 
apoptosis . 

A wide variety of neurological diseases are 
20 characterized by the gradual loss of specific sets of 
neurons. Such disorders include Alzheimer's disease, 
Parkinson's disease, amyotrophic lateral sclerosis (ALS) 
retinitis pigmentosa, spinal muscular atrophy, and 
various forms of cerebellar degeneration. The cell loss 
25 in these diseases does not induce an inflammatory 

response, and apoptosis appears to be the mechanism of 
cell death. 

In addition, a number of hematologic diseases are 
associated with a decreased production of blood cells. 

30 These disorders include anemia associated with chronic 
disease, aplastic anemia, chronic neutropenia, and the 
myelodysplastic syndromes. Disorders of blood cell 
production, such as myelodysplastic syndrome and some 
forms of aplastic anemia, are associated with increased 

35 apoptotic cell death within the bone marrow. These 
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disorders could result from the activation of genes that 
promote apoptosis, acquired deficiencies in stromal cells 
or hematopoietic survival factors, or the direct effects 
of toxins and mediators of immune responses . 
5 Two common disorders associated with cell death 

are myocardial infarctions and stroke. In both 
disorders, cells within the central area of ischemia, 
which is produced in the event of acute loss of blood 
flow, appear to die rapidly as a result of necrosis. 

10 However, outside the central ischemic zone, cells die 
over a more protracted time period and morphologically 
appear to die by apoptosis. 

The invention features a nucleic acid molecule 
which is at least 45% (or 55%, 65%, 75%, 85%, 95%, or 

15 98%) identical to the nucleotide sequence shown in SEQ ID 
NO:l, SEQ ID NO : 3 , SEQ ID NO : 7 , SEQ ID NO : 9 , SEQ ID: 25, 
SEQ ID NO: 27, SEQ ID NO: 38, SEQ ID NO: 40, and SEQ ID 
NO: 42, the nucleotide sequence of the cDNA insert of the 
plasmid deposited with ATCC as Accession Number (the 

20 "cDNA of ATCC " ) , the nucleotide sequence of the 

cDNA insert of the plasmid deposited with ATCC as 

Accession Number (the " cDNA of ATCC " ) , the 

nucleotide sequence of the cDNA insert of the plasmid 
deposited with ATCC as Accession Number (the " cDNA of 

2 5 ATCC " )/ or a complement thereof. 

The invention features a nucleic acid molecule 
which includes a fragment of at least 150 (300, 325, 350, 
375, 400, 425, 450, 500, 550, 600, 650, 700, 800, 900, 
1000, 1300, 1600 or 1931) nucleotides of the nucleotide 

3 0 sequence shown in SEQ ID NO:l, or SEQ ID NO: 3, or the 

nucleotide sequence of the cDNA ATCC , or a 

complement thereof . 

The invention also features a nucleic acid 
molecule which includes a fragment of at least 150 (350, 
35 400, 450, 500, 550, 600, 650, 700, 800, 900, 1000, 1300, 
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1600, 1900, 2100, 2400, 2700, 3000, or 3382) nucleotides 
of the nucleotide sequence shown in SEQ ID NO : 7 , SEQ ID 
NO: 9, SEQ ID NO: 43 or the nucleotide sequence of the 

cDNA ATCC , or a complement thereof. 

5 Also within the invention is a nucleic acid 

molecule which includes a fragment of at least 150 (350, 
400, 450, 500, 550, 600, 650, 700, 800, 900, 1000, 1300, 
1600, 1900, 2100, 2400, 2700, and 3080) nucleotides of 
the nucleotide sequence shown in SEQ ID NO: 25, SEQ ID 
10 NO:27, SEQ ID NO : 3 8 , SEQ ID NO:40, or the nucleotide 

sequence of the cDNA ATCC , or a complement 

thereof . 

The invention features a nucleic acid molecule 
which includes a nucleotide sequence encoding a protein 

15 having an amino acid sequence that is at least 45% (or 
55%, 65%, 75%, 85%, 95%, or 98%) identical to the amino 
acid sequence of SEQ ID NO : 2 , SEQ ID NO : 8 , SEQ ID NO: 26, 
SEQ ID NO: 39, SEQ ID NO : 4 1 , and SEQ ID NO: 43, or the 
amino acid sequence encoded by the cDNA of ATCC , the 

2 0 amino acid sequence encoded by the cDNA of ATCC , or 

the amino acid sequence encoded by the cDNA of ATCC . 



In an embodiment, a CARD- 3 nucleic acid molecule 
has the nucleotide sequence shown in SEQ ID NO:l, or SEQ 
ID NO: 3, or the nucleotide sequence of the cDNA of ATCC 

25 . In another embodiment, a CARD-4L nucleic acid 

molecule has the nucleotide sequence shown in SEQ ID 
NO: 7, or SEQ ID NO: 9, or the nucleotide sequence of the 

cDNA of ATCC . In yet another embodiment, a CARD-4S 

nucleic acid molecule has the nucleotide sequence shown 

30 in SEQ ID NO: 25, or SEQ ID NO: 27, or the nucleotide 

sequence of the cDNA of ATCC . In another 

embodiment, a murine CARD- 4 L nucleic acid molecule has 
the nucleotide sequence shown in SEQ ID NO: 42. In 
another embodiment, a CARD-4Y nucleic acid molecule has 
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the nucleotide sequence shown in SEQ ID NO: 38 or the 

nucleotide sequence of the cDNA of ATCC . In another 

embodiment, a CARD - 4 Z nucleic acid molecule has the 
nucleotide sequence shown in SEQ ID NO: 40 or the 

5 nucleotide sequence of the cDNA of ATCC . 

Also within the invention is a nucleic acid 
molecule which encodes a fragment of a polypeptide having 
the amino acid sequence of SEQ ID NO : 2 or SEQ ID NO : 8 or 
SEQ ID NO: 26 or SEQ ID NO: 39 or SEQ ID NO: 41 or SEQ ID 

10 NO: 43, the fragment including at least 15 (25, 30, 50, 

100, 150, 300, 400 or 540, 600, 700, 800, 953). contiguous 
amino acids of SEQ ID NO : 2 or SEQ ID NO : 8 or SEQ ID NO: 26 
or SEQ ID NO: 39 or SEQ ID NO: 41 or SEQ ID NO: 43 or the 
polypeptide encoded by the cDNA of ATCC Accession Number _ 

15 , or the polypeptide encoded by the cDNA of ATCC 

Accession Number or the polypeptide encoded by the 

cDNA of ATCC 

Accession Number . 

The invention includes a nucleic acid molecule 

20 which encodes a naturally occurring allelic variant of a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2 or SEQ ID NO : 3 9 or SEQ ID NO: 41 or SEQ ID NO: 43 or 
an amino acid sequence encoded by the cDNA of ATCC 
Accession Number , wherein the nucleic acid molecule 

25 hybridizes to a nucleic acid molecule comprising SEQ ID 
NO:l or SEQ ID NO : 3 or SEQ ID NO: 38 or SEQ ID NO: 40 or 
SEQ ID NO: 42 under stringent conditions. The invention 
also includes a nucleic acid molecule which encodes a 
naturally occurring allelic variant of a polypeptide 

3 0 comprising the amino acid sequence of SEQ ID NO: 8 or an 
amino acid sequence encoded by the cDNA of ATCC Accession 

Number , wherein the nucleic acid molecule 

hybridizes to a nucleic acid molecule comprising SEQ ID 
NO: 7 or SEQ ID NO : 9 under stringent conditions. 
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The invention also includes a nucleic acid 
molecule which encodes a naturally occurring allelic 
variant of a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 26 or an amino acid sequence 

5 encoded by the cDNA of ATCC Accession Number , 

wherein the nucleic acid molecule hybridizes to a nucleic 
acid molecule comprising SEQ ID NO: 25 or SEQ ID NO: 27 
under stringent conditions. In general, an allelic 
variant of a gene will be readily identifiable as mapping 

10 to the same chromosomal location as said gene. For 

example, in Example 6, the chromosomal location of the 
human CARD-4 gene is discovered to be chromosome 7 close 
to the SHGC-31928 genetic marker. Allelic variants of 
human CARD-4 will be readily identifiable as mapping to 

15 the human CARD-4 locus on chromosome 7 near genetic 
marker SHGC-31928. 

Also within the invention are: an isolated CARD-3 
protein having an amino acid sequence that is at least 
about 65%, preferably 75%, 85%, 95%, or 98% identical to 

2 0 the amino acid sequence of SEQ ID NO: 2; an isolated 

CARD- 3 protein having an amino acid sequence that is at 
least about 85%, 95%, or 98% identical to the kinase 
domain of SEQ ID NO: 2 (e.g., about amino acid residues 1 
to 300 of SEQ ID NO:2; SEQ ID NO : 4 ) ; and an isolated 

2 5 CARD- 3 protein having an amino acid sequence that is at 
least about 85%, 95%, or 98% identical to the linker 
domain of SEQ ID NO : 2 (e.g., about amino acid residues 
301 to 431 of SEQ ID NO : 2 ; SEQ ID NO : 5 ) ; an isolated 
CARD- 3 protein having an amino acid sequence that is at 

30 least about 85%, 95%, or 98% identical to the CARD domain 
of SEQ ID NO:2 (e.g., about amino acid residues 432 to 
540 of SEQ ID NO:2; SEQ ID NO:6); an isolated CARD- 4 L 
protein having an amino acid sequence that is at least 
about 65%, preferably 75%, 85%, 95%, or 98% identical to 

35 the amino acid sequence of SEQ ID NO: 8; an isolated 
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CARD-4L protein having an amino acid sequence that is at 
least about 85%, 95%, or 98% identical to the CARD domain 
of SEQ ID NO:8 (e.g., about amino acid residues 15 to 114 
of SEQ ID NO:8; SEQ ID NO:10); an isolated CARD - 4 L 
5 protein having an amino acid sequence that is at least 
about 85%, 95%, or 98% identical to the nucleotide 
binding domain of SEQ ID NO: 8 (e.g., about amino acid 
residues 198 to 3 97 of SEQ ID NO: 8; SEQ ID NO: 11; an 
isolated CARD-4L protein having an amino acid sequence 

10 that is at least about 85%, 95%, or 98% identical to the 
kinase la (P-loop) subdomain SEQ ID NO:8 (e.g., about 
amino acid 127 to about amino acid 212 of SEQ ID NO : 8 ; 
SEQ ID NO:46); an isolated CARD-4L protein having an 
amino acid sequence that is at least about 85%, 95%, or 

15 98% identical to the kinase 2 subdomain of SEQ ID NO: 8 
(e.g., about amino acid 273 to about amino acid 288 of 
SEQ ID NO:8; SEQ ID NO:47); an isolated CARD-4L protein 
having an amino acid sequence that is at least about 85%, 
95%, or 98% identical to a kinase 3a subdomain of SEQ ID 

20 NO: 8 (e.g., about amino acid residues 327 to 338 of SEQ 
ID NO: 8; SEQ ID NO: 14); an isolated CARD-4L protein 
having an amino acid sequence that is at least about 85%, 
95%, or 98% identical to the Leucine-rich repeats of SEQ 
ID NO:8 (e.g., about amino acid residues 674 to 701 of 

2 5 SEQ ID NO: 8; SEQ ID NO: 15; from amino acid 702 to amino 

acid 72 7 of SEQ ID NO: 8; SEQ ID NO: 16; which extends from 
amino acid 728 to amino acid 754 SEQ ID NO: 8; SEQ ID 
NO: 17; from amino acid 755 to amino acid 782 of SEQ ID 
NO: 8; SEQ ID NO: 18; from amino acid 783 to amino acid 810 

3 0 of SEQ ID NO: 8; SEQ ID NO: 19; from amino acid 811 to 

amino acid 838 of SEQ ID NO: 8; SEQ ID NO: 20 from amino 
acid 83 9 to amino acid 866 of SEQ ID NO : 8 ; SEQ ID NO: 21; 
from amino acid 867 to amino acid 894 of SEQ ID NO: 8; SEQ 
ID NO: 22; from amino acid 895 to amino acid 922 of SEQ ID 
35 NO: 8; SEQ ID NO:23; and from amino acid 923 to amino acid 
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950 of SEQ ID NO:8; SEQ ID NO:24); an isolated CARD- 4 S 
protein having an amino acid sequence that is at least 
about 65%, preferably 75%, 85%, 95%, or 98% identical to 
the amino acid sequence of SEQ ID NO: 26; an isolated 
5 CARD - 4 S protein having an amino acid sequence that is at 
least about 85%, 95%, or 98% identical to the CARD domain 
of SEQ ID NO:26 (e.g., about amino acid residues 1 to 74 
of SEQ ID NO:26; SEQ ID NO:28) . Also within the invention 
are: an isolated murine CARD-4L protein having an amino 

10 acid sequence that is at least about 65%, preferably 75%, 
85%, 95%, or 98% identical to the amino acid sequence of 
SEQ ID NO: 43. Also within the invention are: an isolated 
CARD-4Y protein having an amino acid sequence that is at 
least about 65%, preferably 75%, 85%, 95%, or 98% 

15 identical to the amino acid sequence of SEQ ID NO: 39. 
Also within the invention are: an isolated CARD-4Z 
protein having an amino acid sequence that is at least 
about 65%, preferably 75%, 85%, 95%, or 98% identical to 
the amino acid sequence of SEQ ID NO:41. 

20 Also within the invention are: an isolated CARD-3 
protein which is encoded by a nucleic acid molecule 
having a nucleotide sequence that is at least about 65%, 
preferably 75%, 85%, or 95% identical to SEQ ID NO : 3 or 
the cDNA of ATCC ; an isolated CARD-3 protein which 

25 is encoded by a nucleic acid molecule having a nucleotide 
sequence at least about 65% preferably 75%, 85%, or 95% 
identical to the kinase domain encoding portion of SEQ ID 
NO:l (e.g., about nucleotides 213 to 1113 of SEQ ID 
NO:l); an isolated CARD-3 protein which is encoded by a 

3 0 nucleic acid molecule having a nucleotide sequence at 

least about 65% preferably 75%, 85%, or 95% identical the 
linker domain encoding portion of SEQ ID NO : 1 (e.g., 
about nucleotides 1114 to 1506 of SEQ ID NO:l); and an 
isolated CARD-3 protein which is encoded by a nucleic 

35 acid molecule having a nucleotide sequence at least about 
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65% preferably 75%, 85%, or 95% identical the CARD domain 
encoding portion of SEQ ID NO:l (e.g., about nucleotides 
1507 to 1833 of SEQ ID NO : 1 ) ; and an isolated CARD-3 
protein which is encoded by a nucleic acid molecule 
5 having a nucleotide sequence which hybridizes under 
stringent hybridization conditions to a nucleic acid 
molecule having the nucleotide sequence of SEQ ID NO: 3 or 

the non-coding strand of the cDNA of ATCC . Also 

within the invention are: an isolated CARD-4Y protein 

10 which is encoded by a nucleic acid molecule having a 
nucleotide sequence that is at least about 65%, 
preferably 75%, 85%, or 95% identical to SEQ ID NO: 38 or 

the cDNA of ATCC . Also within the invention are 

nucleic acid molecules which include about nucleotides 

15 2759 to 2842 of SEQ ID NO : 7 ; about nucleotides 2843 to 
2926 of SEQ ID NO : 7 ; about nucleotides 2927 to 3010 of 
SEQ ID NO: 7; about nucleotides 3 011 to 3 0 94 of SEQ ID 
NO: 7; and an isolated CARD - 4 L protein which is encoded by 
a nucleic acid molecule having a nucleotide sequence 

20 which hybridizes under stringent hybridization conditions 
to a nucleic acid molecule having the nucleotide sequence 
of SEQ ID NO: 9 or the non-coding strand of the cDNA of 
ATCC . 

Also within the invention are: an isolated CARD-4S 
25 protein which is encoded by a nucleic acid molecule 

having a nucleotide sequence that is at least about 65%, 
preferably 75%, 85%, or 95% identical to SEQ ID NO: 27 or 

the cDNA of ATCC ; an isolated CARD-3 protein which 

is encoded by a nucleic acid molecule having a nucleotide 
30 sequence at least about 65% preferably 75%, 85%, or 95% 
identical the CARD domain encoding portion of SEQ ID 
NO:25 (e.g., about nucleotides 1 to 222 of SEQ ID NO:25); 
an isolated CARD-3 protein which is encoded by a nucleic 
acid molecule having a nucleotide sequence at least about 
35 65% preferably 75%, 85%, or 95% identical the P-Loop 
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encoding portion of SEQ ID NO:25 (e.g.,. about nucleotides 
485 to 510 of SEQ ID NO:25) . 

Also within the invention is a polypeptide which 
is a naturally occurring allelic variant of a polypeptide 
5 that includes the amino acid sequence of SEQ ID NO : 2 or 
an amino acid sequence encoded by the cDNA insert of the 

plasmid deposited with ATCC as Accession Number , 

wherein the polypeptide is encoded by a nucleic acid 
molecule which hybridizes to a nucleic acid molecule 
10 comprising SEQ ID NO : 1 or SEQ ID NO : 3 under stringent 
condit ions . 

Also within the invention is a polypeptide which 
is a naturally occurring allelic variant of a polypeptide 
that includes the amino acid sequence of SEQ ID NO : 8 or 
15 an amino acid sequence encoded by the cDNA insert of the 

plasmid deposited with ATCC as Accession Number 

wherein the polypeptide is encoded by a nucleic acid 
molecule which hybridizes to a nucleic acid molecule 
comprising SEQ ID NO: 7 or SEQ ID NO : 9 under stringent 

2 0 conditions. 

Also within the invention is a polypeptide which 
is a naturally occurring allelic variant of a polypeptide 
that includes the amino acid sequence of SEQ ID NO: 26 or 
an amino acid sequence encoded by the cDNA insert of the 

25 plasmid deposited with ATCC as Accession Number , 

wherein the polypeptide is encoded by a nucleic acid 
molecule which hybridizes to a nucleic acid molecule 
comprising SEQ ID NO: 25 or SEQ ID NO: 27 under stringent 
conditions . 

3 0 Another embodiment of the invention features 

CARD-3 or CARD-4 nucleic acid molecules which 
specifically detect CARD-3 or CARD-4 nucleic acid 
molecules, relative to nucleic acid molecules encoding 
other members of the CARD superfamily. For example, in 
35 one embodiment, a CARD- 4 L nucleic acid molecule 
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hybridizes under stringent conditions to a nucleic acid 
molecule comprising the nucleotide sequence of SEQ ID 

NO: 7, SEQ ID NO: 9, or the cDNA of ATCC , or a 

complement thereof. In another embodiment, the CARD-4L 
5 nucleic acid molecule is at least 300 (350, 400, 450, 
500, 550, 600, 650, 700, 800, 900, 1000, 1300, 1600, 
1900, 2100, 2400, 2700, 3000, or 3382) nucleotides in 
length and hybridizes under stringent conditions to a 
nucleic acid molecule comprising the nucleotide sequence 

10 shown in SEQ ID NO: 7, SEQ ID NO: 9, the cDNA of ATCC 

, or a complement thereof. In another embodiment, 

an isolated CARD-4L nucleic acid molecule comprises 
nucleotides 287 to 586 of SEQ ID NO: 7, encoding the CARD 
domain of CARD-4L, or a complement thereof. In yet 

15 another embodiment, the invention provides an isolated 
nucleic acid molecule which is antisense to the coding 
strand of a CARD-4L nucleic acid. 

Another aspect of the invention provides a vector, 
e.g., a recombinant expression vector, comprising a 

20 CARD-3 or a CARD-4L nucleic acid molecule of the 

invention. In another embodiment the invention provides 
a host cell containing such a vector. The invention also 
provides a method for producing CARD-3 or CARD- 4 protein 
by culturing, in a suitable medium, a host cell of the 

2 5 invention containing a recombinant expression vector such 

that a CARD-3 or CARD-4 protein is produced. 

Another aspect of this invention features isolated 
or recombinant CARD-3 or CARD-4 proteins and 
polypeptides. Preferred CARD-3 or CARD-4 proteins and 

3 0 polypeptides possess at least one biological activity 

possessed by naturally occurring human CARD-3 or CARD-4, 
e.g., (1) the ability to 

form protein : protein interactions with proteins in the 
apoptotic signalling pathway; (2) the ability to form 
3 5 CARD-CARD interactions with proteins in the apoptotic 
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signaling pathway; (3) the ability to bind the CARD-3 or 
CARD-4 ligand; (4) and the ability to bind to an 
intracellular target. Other activities include: (1) 
modulation of cellular proliferation, (2) modulation of 
5 cellular differentiation and (3) modulation of cellular 
death (4) modulation of the NF-kB pathway. 

The CARD- 3 or CARD-4 proteins of the present 
invention, or biologically active portions thereof, can 
be operatively linked to a non-CARD-3 or non-CARD-4 

10 polypeptide (e.g., heterologous amino acid sequences) to 
form CARD-3 or CARD-4 fusion proteins, respectively. The 
invention further features antibodies that specifically 
bind CARD-3 or CARD-4 proteins, such as monoclonal or 
polyclonal antibodies. In addition, the CARD-3 or CARD-4 

15 proteins or biologically active portions thereof can be 
incorporated into pharmaceutical compositions, which 
optionally include pharmaceutically acceptable carriers. 

In another aspect, the present invention provides 
a method for detecting the presence of CARD-3 or CARD-4 
activity or expression in a biological sample by 
contacting the biological sample with an agent capable of 
detecting an indicator of CARD-3 or CARD-4 activity such 
that the presence of CARD-3 or CARD-4 activity is 
detected in the biological sample. 

In another aspect, the invention provides a method 
for modulating CARD-3 or CARD-4 activity comprising 
contacting a cell with an agent that modulates (inhibits 
or stimulates) CARD-3 or CARD-4 activity or expression 
such that CARD-3 or CARD-4 activity or expression in the 
cell is modulated. In one embodiment, the agent is an 
antibody that specifically binds to CARD-3 or CARD-4 
protein. In another embodiment, the agent modulates 
expression of CARD-3 or CARD-4 by modulating 
transcription of a CARD-3 or CARD-4 gene, splicing of a 



25 
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CARD-3 or CARD-4 mRNA, or translation of a CARD-3 or 
CARD- 4 mRNA. In yet another embodiment, the agent is a 
nucleic acid molecule having a nucleotide sequence that 
is antisense to the coding strand of the CARD-3 or CARD-4 
5 mRNA or the CARD-3 or CARD-4 gene. 

In one embodiment, the methods of the present 
invention are used to treat a subject having a disorder 
characterized by aberrant CARD-3 or CARD-4 protein or 
nucleic acid expression or activity by administering an 
10 agent which is a CARD-3 or CARD-4 modulator to the 
subject. In one embodiment, the CARD-3 or CARD-4 
modulator is a CARD-3 or CARD-4 protein. In another 
embodiment the CARD-3 or CARD-4 modulator is a CARD-3 or 
CARD-4 nucleic acid molecule. In other embodiments, the 
15 CARD-3 or CARD-4 modulator is a peptide, peptidomimetic , 
or other small molecule. 

The present invention also provides a diagnostic 
assay for identifying the presence or absence of a 
genetic lesion or mutation characterized by at least one 
20 of: (i) aberrant modification or mutation of a gene 

encoding a CARD-3 or CARD-4 protein; (ii) mis -regulation 
of a gene encoding a CARD-3 or CARD-4 protein; (iii) 
aberrant RNA splicing; and (iv) aberrant 
post-translational modification of a CARD-3 or CARD-4 
25 protein, wherein a wild-type form of the gene encodes a 
protein with a CARD-3 or CARD-4 activity. 

In another aspect, the invention provides a 
method for identifying a compound that binds to or 
modulates the activity of a CARD-3 or CARD-4 protein. In 
3 0 general, such methods entail measuring a biological 

activity of a CARD-3 or CARD-4 protein in the presence 
and absence of a test compound and identifying those 
compounds which alter the activity of the CARD-3 or 
CARD-4 protein. 
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The invention also features methods for 
identifying a compound which modulates the expression of 
CARD- 3 or CARD-4 by measuring the expression of CARD- 3 or 
CARD- 4 in the presence and absence of a compound. 
5 Other features and advantages of the invention 

will be apparent from the following detailed description 
and claims. 



Brief Description of the Drawings 
Figure 1 depicts the cDNA sequence (SEQ ID NO:l) 
10 of human CARD-3. The open reading frame of CARD-3 (SEQ 
ID NO:l) extends from nucleotide 213 to nucleotide 1833 
nucleotide (SEQ ID NO:3). 

Figure 2 depicts the predicted amino acid sequence 
(SEQ ID NO: 2) of human CARD-3. 
15 Figure 3 depicts the cDNA sequence (SEQ ID NO: 7) 

of CARD - 4 L . The open reading frame of SEQ ID NO : 7 
extends from nucleotide 245 to nucleotide 3103 (SEQ ID 
NO: 9) . 

Figure 4 depicts the predicted amino acid sequence 
2 0 (SEQ ID NO: 8) of human CARD-4L. 

Figure 5 depicts the partial cDNA sequence (SEQ ID 
NO: 25) of CARD-4S and the predicted amino acid sequence 
(SEQ ID NO:25) of human CARD-4S. The open reading frame 
of CARD-4 (SEQ ID NO: 25) extends from nucleotide 1 to 
25 nucleotide 1470 (SEQ ID NO:27). 

Figure 6 depicts the predicted amino acid sequence 
(SEQ ID NO: 26) of human CARD- 4 S . 

Figure 7 depicts an alignment of the CARD domains 
of CARD-4 (SEQ ID NO:10), CARD-3 (SEQ ID NO:6), ARC-CARD 
30 (SEQ ID NO:31), cIAPl-CARD (SEQ ID NO:32) and c I AP2 - CARD 
(SEQ ID NO: 33) . 
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Figure 8 is a plot showing predicted structural 
features of human CARD-4L. 

Figure 9 is a plot showing predicted structural 
features of human CARD - 4 S . 
5 Figure 10 depicts the cDNA sequence (SEQ ID NO: 38) 

of the human CARD-4Y splice variant clone. The predicted 
open reading frame of the human CARD-4Y splice variant 
clone extends from nucleotide 438 to nucleotide 1184. 

Figure 11 depicts the amino acid sequence (SEQ ID 
10 NO: 39) of the protein predicted to be encoded by the 
human CARD-4Y cDNA open reading frame. 

Figure 12 depicts the cDNA sequence (SEQ ID NO: 40) 
of the human CARD-4Z splice variant clone. The predicted 
open reading frame of the human CARD-4Z splice variant 
15 clone extends from nucleotide 489 to nucleotide 980. 

Figure 13 depicts the amino acid sequence (SEQ ID 
NO: 41) of the protein predicted to be encoded by the 
human CARD- 4 Z cDNA open reading frame. 

Figure 14 depicts an alignment of human CARD- 4 L 
20 (SEQ ID NO: 8), the predicted amino acid sequence of human 
CARD-4Y (SEQ ID NO:39), and the predicted amino acid 
sequence of human CARD- 4 Z (SEQ ID NO: 41) . 

Figure 15 depicts the nucleotide sequence of the 
murine CARD-4L cDNA (SEQ ID NO:42). 
25 Figure 16 depicts the predicted amino acid 

sequence of murine CARD-4L (SEQ ID NO: 43) . 

Figure 17 depicts an alignment of human CARD-4L 
(SEQ ID NO: 8) and the predicted amino acid sequence of 
murine CARD- 4 L (SEQ ID NO: 43) . 
30 Figure 18 depicts a 32042 nucleotide genomic 

sequence of CARD -4 . 



Detailed Description of the Invention 
The present invention is based, in part, on the 
discovery of cDNA molecules encoding human CARD- 3 , human 
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CARD-4 and partial murine CARD - 4 L proteins. A nucleotide 
sequence encoding a human CARD- 3 protein is shown in 
Figure 1 (SEQ ID NO : 1 ; SEQ ID NO : 3 includes the open 
reading frame only) . A predicted amino acid sequence of 
5 CARD-3 protein is also shown in Figure 2 (SEQ ID NO : 2 ) . 
CARD-4 has at least two forms, a long form, CARD - 4 L , and 
a short form, CARD - 4 S , as well as two or more splice 
variants. A nucleotide sequence encoding a human CARD-4L 
protein is shown in Figure 3 (SEQ ID NO : 7 ; SEQ ID NO : 9 

10 includes the open reading frame only) . A predicted amino 
acid sequence of CARD-4L protein is also shown in Figure 
4 (SEQ ID NO: 8) . A nucleotide sequence encoding a human 
CARD-4S protein is shown in Figure 5 (SEQ ID NO: 25; SEQ 
ID NO: 27 includes the open reading frame only) . A 

15 predicted amino acid sequence of CARD - 4 S protein is also 
shown in Figure 6 (SEQ ID NO:26). Two additional splice 
variants of human CARD-4 are provided in Figures 10 and 
11 (human CARD - 4 Y ) and Figures 12 and 13 (human CARD - 4 Z ) 
(predicted amino acid sequences: SEQ ID NO: 39 and SEQ ID 

2 0 NO: 41 and nucleic acid sequences: SEQ ID NO: 38 and SEQ ID 

NO:40). These two splice variants are predicted to 
contain 249 and 164 amino acids, respectively. An 
alignment of human CARD-4Y, human CARD-4Z and human 
CARD -4 L is shown in Figure 14. 
25 In addition to the human CARD-4 proteins, a full 

length nucleotide sequence of the murine ortholog of 
human CARD- 4 L is provided in Figure 15 (SEQ ID NO: 42) . An 
alignment of murine CARD- 4 L, with human CARD - 4 L is shown 
in Figure 17. 

30 The human CARD-3 cDNA of Figure 1 (SEQ ID NO:l), 

which is approximately 1931 nucleotides long including 
untranslated regions, encodes a protein amino acid having 
a molecular weight of approximately 61 kDa (excluding 
post- translational modifications) . A plasmid containing 

3 5 a cDNA encoding human CARD-3 (with the cDNA insert name 
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of) was deposited with American Type Culture Collection 

(ATCC) , Manasass, VA on and assigned Accession 

Number . This deposit will be maintained under 

the terms of the Budapest Treaty on the International 
5 Recognition of the Deposit of Microorganisms for the 
Purposes of Patent Procedure. This deposit was made 
merely as a convenience for those of skill in the art and 
is not an admission that a deposit is required under 3 5 
U.S.C. §112. 

10 The human CARD-4L cDNA of Figure 3 (SEQ ID NO:7), 

which is approximately 33 82 nucleotides long including 
untranslated regions, encodes a protein amino acid having 
a molecular weight of approximately 10 8 kDa (excluding 
post-translational modifications) . A plasmid containing 

15 a cDNA encoding human CARD-4L (with the cDNA insert name 

of ) was deposited with American Type Culture 

Collection (ATCC) , Manasass, VA on and assigned 

Accession Number . This deposit will be 

maintained under the terms of the Budapest Treaty on the 

20 International Recognition of the Deposit of 

Microorganisms for the Purposes of Patent Procedure. 
This deposit was made merely as a convenience for those 
of skill in the art and is not an admission that a 
deposit is required under 35 U.S.C. §112. 

25 The human CARD-4S cDNA of Figure 5 (SEQ ID NO:25), 

which is 3 082 nucleotides long including untranslated 
regions. A plasmid containing a cDNA encoding human 

CARD- 4 S (with the cDNA insert name of ) was 

deposited with American Type Culture Collection (ATCC) , 

3 0 Manasass, VA on and assigned Accession Number 

This deposit will be maintained under the terms of the 
Budapest Treaty on the International Recognition of the 
Deposit of Microorganisms for the Purposes of Patent 
Procedure. This deposit was made merely as a convenience 
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for those of skill in the art and is not an admission 
that a deposit is required under 35 U.S.C. §112. 

A region of human CARD- 4 L protein (SEQ ID NO : 8 ) 
bears some similarity to a CARD domain of CARD- 3 (SEQ ID 
5 NO:6), ARC-CARD (SEQ ID NO : 3 1 ) , cIAPl-CARD (SEQ ID 

NO:32), and CIAP2-CARD (SEQ ID NO:33). This comparison 
is depicted in Figure 7. 

Human CARD- 3 or CARD - 4 are members of a family of 
molecules (the "CARD family") having certain conserved 

10 structural and functional features. The term "family" 

when referring to the protein and nucleic acid molecules 
of the invention is intended to mean two or more proteins 
or nucleic acid molecules having a common structural 
domain and having sufficient amino acid or nucleotide 

15 sequence identity as defined herein. Such family members 
can be naturally occurring and can be from either the 
same or different species. For example, a family can 
contain a first protein of human origin and a homologue 
of that protein of murine origin, as well as a second, 

20 distinct protein of human origin and a murine homologue 
of that protein. Members of a family may also have 
common functional characteristics. 

In one embodiment, a CARD- 3 or CARD -4 protein 
includes a CARD domain having at least about 65%, 

25 preferably at least about 75%, and more preferably about 
85%, 95%, or 98% amino acid sequence identity to the CARD 
domain of SEQ ID NO: 6 or the CARD domain of SEQ ID NO: 10 
or the CARD domain of SEQ ID NO: 28. 

Preferred CARD- 3 or CARD-4 polypeptides of the 

3 0 present invention have an amino acid sequence 

sufficiently identical to the CARD domain consensus amino 
acid sequence of SEQ ID NO: 6, SEQ ID NO: 10, SEQ ID NO: 28, 
respectively. The CARD-3 polypeptide also has an amino 
acid sequence sufficiently identical to the kinase domain 

3 5 consensus sequence of SEQ ID NO: 4, and an amino acid 
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sequence that is sufficiently identical to the linker 
domain of SEQ ID NO: 5. The CARD-4L polypeptide has an 
amino acid sequence sufficiently identical to the 
nucleotide binding domain of SEQ ID NO: 11, an amino acid 
5 sequence sufficiently identical to the Walker Box "A" of 
SEQ ID NO: 12 or Walker Box "B" of SEQ ID NO: 13, an amino 
acid sequence sufficiently identical to the kinase la 
subdomain of SEQ ID NO: 46, an amino acid sequence 
sufficiently identical to the kinase 2 subdomain of SEQ 

10 ID NO: 47, or an amino acid sequence sufficiently 

identical to the kinase 3a subdomain of SEQ ID NO: 14, or 
an amino acid sequence sufficiently identical to the 
Leucine -rich repeats of SEQ ID NO: 14, SEQ ID NO: 15, SEQ 
ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ 

15 ID i?O:20, SEQ ID NO: 21, SEQ ID NO: 22, and SEQ ID NO: 23. 
As used herein, the term 11 sufficiently identical" refers 
to a first amino acid or nucleotide sequence which 
contains a sufficient or minimum number of identical or 
equivalent (e.g., an amino acid residue which has a 

20 similar side chain) amino acid residues or nucleotides to 
a second amino acid or nucleotide sequence such that the 
first and second amino acid or nucleotide sequences have 
a common structural domain and/or common functional 
activity- For example, amino acid or nucleotide 

2 5 sequences which contain a common structural domain having 
about 65% identity, preferably 75% identity, more 
preferably 85%, 95%, or 98% identity are defined herein 
as sufficiently identical. 

As used interchangeably herein a "CARD- 3 or CARD- 4 

30 activity", "biological activity of CARD-3 or CARD-4" or 
"functional activity of CARD-3 or CARD-4", refers to an 
activity exerted by a CARD-3 or CARD-4 protein, 
polypeptide or nucleic acid molecule on a CARD-3 or 
CARD-4 responsive cell as determined in vivo, or in 

35 vitro, according to standard techniques. A CARD-3 or 
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CARD-4 activity can be a direct activity, such as an 
association with or an enzymatic activity on a second 
protein or an indirect activity, such as a cellular 
signaling activity mediated by interaction of the CARD-3 
5 or CARD-4 protein with a second protein. In an 

embodiment, a CARD-3 or CARD-4 activity includes at least 
one or more of the following activities: (i) interaction 
with proteins in the apoptotic signalling pathway (ii) 
interaction with a CARD- 3 or CARD-4 ligand; or (iii) 

10 interaction with an intracellular target protein; (iv) 
indirect interaction with caspases . For example, in 
Example 4, CARD-3 -containing proteins were shown to 
associate with CARD-4 -containing proteins. In example 9, 
CARD-4 proteins were shown to induce NF~eB-mediated 

15 transcription. In example 10, CARD-3 and CARD-4 were 
shown to enhance caspase 9 activity. 

Accordingly, another embodiment of the invention 
features isolated CARD-3 or CARD-4 proteins and 
polypeptides having a CARD-3 or CARD-4 activity. 

20 Various aspects of the invention are described in 

further detail in the following subsections. 

I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated 
nucleic acid molecules that encode CARD-3 or CARD-4 

25 proteins or biologically active portions thereof, as well 
as nucleic acid molecules sufficient for use as 
hybridization probes to identify CARD-3 or 
CARD-4 -encoding nucleic acids (e.g., CARD-3 or CARD-4 
mRNA) and fragments for use as PCR primers for the 

30 amplification or mutation of CARD-3 or CARD-4 nucleic 
acid molecules. As used herein, the term "nucleic acid 
molecule" is intended to include DNA molecules (e.g., 
cDNA or genomic DNA) and RNA molecules (e.g., mRNA) and 
analogs of the DNA or RNA generated using nucleotide 
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analogs. The nucleic acid molecule can be 
single-stranded or double -stranded, but preferably is 
double -stranded DNA. 

An "isolated" nucleic acid molecule is one which 
5 is separated from other nucleic acid molecules which are 
present in the natural source of the nucleic acid. 
Preferably, an "isolated" nucleic acid is free of 
sequences (preferably protein encoding sequences) which 
naturally flank the nucleic acid (i.e., sequences located 

10 at the 5' and 3' ends of the nucleic acid) in the genomic 
DNA of the organism from which the nucleic acid is 
derived. For example, in various embodiments, the 
isolated CARD-3 or CARD-4L./S nucleic acid molecule can 
contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 

15 kb or 0.1 kb of nucleotide sequences which naturally 
flank the nucleic acid molecule in genomic DNA of the 
cell from which the nucleic acid is derived. Moreover, 
an "isolated" nucleic acid molecule, such as a cDNA 
molecule, can be substantially free of other cellular 

2 0 material, or culture medium when produced by recombinant 
techniques, or substantially free of chemical precursors 
or other chemicals when chemically synthesized. 

A nucleic acid molecule of the present invention, 
e.g., a nucleic acid molecule having the nucleotide 

2 5 sequence of SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO: 7, SEQ ID 

NO: 9, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 38, SEQ ID 

NO: 40, SEQ ID NO: 42, the cDNA of ATCC , the cDNA of 

ATCC , or a complement of any of these nucleotide 

sequences, can be isolated using standard molecular 

3 0 biology techniques and the sequence information provided 

herein. Using all or portion of the nucleic acid 
sequences of SEQ ID NO:l, SEQ ID NO: 3, SEQ ID NO: 7, SEQ 
ID NO: 9, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 38, SEQ ID 

NO: 40, SEQ ID NO: 42, the cDNA of ATCC or the cDNA 

35 of ATCC , as a hybridization probe, CARD-3 or CARD -4 
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nucleic acid molecules can be isolated using standard 
hybridization and cloning techniques (e.g., as described 
in Sambrook et al . , eds. , Molecular Cloning: A 
Laboratory Manual. 2nd, ed . , Cold Spring Harbor 
5 Laboratory, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1989) . 

A nucleic acid of the invention can be amplified 
using cDNA, mRNA or genomic DNA as a template and 
appropriate oligonucleotide primers according to standard 

10 PCR amplification techniques. The nucleic acid so 

amplified can be cloned into an appropriate vector and 
characterized by DNA sequence analysis. Furthermore, 
oligonucleotides corresponding to CARD- 3 or CARD-4 
nucleotide sequences can be prepared by standard 

15 synthetic techniques, e.g., using an automated DNA 
synthesizer. 

In another embodiment, an isolated nucleic acid 
molecule of the invention comprises a nucleic acid 
molecule which is a complement of the nucleotide sequence 

2 0 shown in SEQ ID NO:l, SEQ ID NO : 3 , SEQ ID NO : 7 , SEQ ID 
NO: 9, SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 38, SEQ ID 

NO: 40, SEQ ID NO: 42, the cDNA of ATCC or the cDNA 

of ATCC or a portion thereof. A nucleic acid 

molecule which is complementary to a given nucleotide 

25 sequence is one which is sufficiently complementary to 
the given nucleotide sequence that it can hybridize to 
the given nucleotide sequence thereby forming a stable 
duplex . 

Moreover, the nucleic acid molecule of the 
30 invention can comprise only a portion of a nucleic acid 
sequence encoding CARD- 3 or CARD-4, for example, a 
fragment which can be used as a probe or primer or a 
fragment encoding a biologically active portion of CARD- 3 
or CARD-4. The nucleotide sequence determined from the 
35 cloning of the human CARD-3 or CARD-4, and the partial 
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murine CARD-4 gene allows for the generation of probes 
and primers designed for use in identifying and/or 
cloning CARD-3 or CARD-4 homologues in other cell types, 
e.g., from other tissues, as well as CARD-3 or CARD-4 
5 homologues and orthologs from other mammals. The 

probe/primer typically comprises substantially purified 
oligonucleotide. The oligonucleotide typically comprises 
a region of nucleotide sequence that hybridizes under 
stringent conditions to at least about 12, preferably 

10 about 25, more preferably about 50, 75, 100, 125, 150, 

175, 200, 250, 300, 350 or 400 consecutive nucleotides of 
the sense or ant i -sense sequence of SEQ ID NO:l, SEQ ID 

NO: 3, SEQ ID NO : 7 , SEQ ID NO : 9 , the cDNA of ATCC , 

the cDNA of ATCC or of a naturally occurring mutant 

15 of SEQ ID N0:1, SEQ ID NO : 3 , SEQ ID NO : 7 , SEQ ID NO : 9 , 

the cDNA of ATCC , or the cDNA of ATCC . 

Probes based on the human CARD-3 or human CARD-4 
or murine CARD-4 nucleotide sequence can be used to 
detect transcripts or genomic sequences encoding the same 

20 or identical proteins. The probe comprises a label group 
attached thereto, e.g., a radioisotope, a fluorescent 
compound, an enzyme, or an enzyme co-factor. Such probes 
can be used as a part of a diagnostic test kit for 
identifying allelic variants and orthologs of the CARD-3 

25 and CARD-4 proteins of the present invention, identifying 
cells or tissue which mis-express a CARD-3 or CARD-4 
protein, such as by measuring a level of a CARD-3 or 
CARD-4 -encoding nucleic acid in a sample of cells from a 
subject, e.g., detecting CARD-3 or CARD-4 mRNA levels or 

3 0 determining whether a genomic CARD-3 or CARD-4 gene has 
been mutated or deleted. 

A nucleic acid fragment encoding a "biologically 
active portion of CARD-3 or CARD-4L" can be prepared by 
isolating a portion of SEQ ID NO:l, SEQ ID NO: 3, SEQ ID 

3 5 NO: 7, SEQ ID NO : 9 , or the nucleotide sequence of the cDNA 
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of ATCC , or the nucleotide sequence of the cDNA of 

ATCC which encodes a polypeptide having a CARD-3 or 

CARD-4 biological activity, expressing the encoded 
portion of CARD-3 or CARD-4 protein (e.g., by recombinant 
5 expression in vitro) and assessing the activity of the 
encoded portion of CARD-3 or CARD-4. For example, a 
nucleic acid fragment encoding a biologically active 
portion of CARD-3 or CARD-4 includes a CARD domain, e.g., 
SEQ ID NO: 6 and SEQ ID NO: 10 or SEQ ID NO: 26. 

10 The invention further encompasses nucleic acid 

molecules that differ from the nucleotide sequence of SEQ 
ID NO:l, SEQ ID NO : 3 , SEQ ID NO : 7 , SEQ ID NO : 9 , SEQ ID 
NO: 25, SEQ ID NO : 2 7 , SEQ ID NO: 38, SEQ ID NO: 40, SEQ ID 
NO: 42, the cDNA of ATCC or the cDNA of ATCC 

15 due to degeneracy of the genetic code and thus encode the 
same CARD-3 or CARD-4 protein as that encoded by the 
nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO : 3 , 
SEQ ID NO: 7, SEQ ID NO : 9 , SEQ ID NO: 25, SEQ ID NO: 27, SEQ 
ID NO: 38, SEQ ID NO : 4 0 , SEQ ID NO: 42, the cDNA of ATCC 

2 0 or the cDNA of ATCC . 

In addition to the human CARD-3 or CARD-4 
nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO : 3 , 
SEQ ID NO:7, SEQ ID NO : 9 , SEQ ID NO:25, SEQ ID NO:27, SEQ 
ID NO: 38, SEQ ID NO : 4 0 , the cDNA of ATCC , the cDNA 

2 5 of ATCC , or the cDNA of ATCC , and the murine 

CARD-4L cDNA sequence shown in SEQ ID NO: 42 it will be 
appreciated by those skilled in the art that DNA sequence 
polymorphisms that lead to changes in the amino acid 
sequences of CARD-3 or CARD-4 may exist within a 

30 population (e.g., the human population) . Such genetic 

polymorphism in the CARD-3 or CARD-4 gene may exist among 
individuals within a population due to natural allelic 
variation. As used herein, the terms "gene" and 
'•recombinant gene" refer to nucleic acid molecules 

35 comprising an open reading frame encoding a CARD-3 or 
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CARD-4 protein, preferably a mammalian CARD- 3 or CARD-4 
protein. Such natural allelic variations can typically 
result in 1-5% variance in the nucleotide sequence of the 
CARD-3 or CARD-4 gene. Any and all such nucleotide 
5 variations and resulting amino acid polymorphisms in 
CARD-3 or CARD-4 that are the result of natural allelic 
variation and that do not alter the functional activity 
of CARD-3 or CARD-4 are intended to be within the scope 
of the invention . 

10 Moreover, nucleic acid molecules encoding CARD-3 

or CARD-4 proteins from other species (CARD-3 or CARD-4 
orthologs /homologues ) , which have a nucleotide sequence 
which differs from that of a human CARD-3 or CARD-4, are 
intended to be within the scope of the invention. For 

15 example, Example 5 describes the murine CARD-4 ortholog. 
Nucleic acid molecules corresponding to natural allelic 
variants and homologues of the CARD-3 or CARD-4 cDNA of 
the invention can be isolated based on their identity to 
the human CARD-3 or human or murine CARD-4 nucleic acids 

20 disclosed herein using the human or murine cDNAs , or a 
portion thereof, as a hybridization probe according to 
standard hybridization techniques under stringent 
hybridization conditions. In general, an allelic variant 
of a gene will be readily identifiable as mapping to the 

25 same chromosomal location as said gene. For example, in 
Example 6, the chromosomal location of the human CARD-4 
gene is discovered to be chromosome 7 close to the 
SHGC-31928 genetic marker. Allelic variants of human 
CARD-4 will be readily identifiable as mapping to the 

3 0 human CARD-4 locus on chromosome 7 near genetic marker 
SHGC-31928. 

Accordingly, in another embodiment, an isolated 
nucleic acid molecule of the invention is at least 300 
(325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 
35 800, 900, 1000, 1300, 1600 or 1931) nucleotides in length 
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and hybridizes under stringent conditions to the nucleic 
acid molecule comprising the nucleotide sequence, 
preferably the coding sequence, of SEQ ID NO : 1 , SEQ ID 

NO: 3, or the cDNA of ATCC . In yet another 

5 embodiment, an isolated nucleic acid molecule of the 

invention is at least 300 (325, 350, 375, 400, 425, 450, 
500, 550, 600, 650, 700, 800, 900, 1000, or 1300, 1640, 
1900, 2200, 2500, 2800, 3100, or 3382) nucleotides in 
length and hybridizes under stringent conditions to the 

10 nucleic acid molecule comprising the nucleotide sequence, 
preferably the coding sequence, of SEQ ID NO : 7 , SEQ ID 
NO: 9, SEQ ID NO: 38, SEQ ID NO: 40, or the CDNA of ATCC 

. Accordingly, in another embodiment, an isolated 

nucleic acid molecule of the invention is at least 300 

15 (325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 
800, 900, 1000, or 1300, 1640, 1900, 2200, 2500, 2800, 
3100, 3300, 3600, 3900, 4200 or 4209) nucleotides in 
length and hybridizes under stringent conditions to the 
nucleic acid molecule comprising the nucleotide sequence, 

2 0 preferably the coding sequence, of SEQ ID NO: 42. 

As used herein, the term "hybridizes under 
stringent conditions" is intended to describe conditions 
for hybridization and washing under which nucleotide 
sequences at least 60% (65%, 70%, preferably 75%) 

25 identical to each other typically remain hybridized to 

each other. Such stringent conditions are known to those 
skilled in the art and can be found in Current Protocols 
in Molecular Biology, John Wiley & Sons, N.Y. (1989), 
6.3.1-6.3.6. An, non-limiting example of stringent 

30 hybridization conditions are hybridization in 6X sodium 
chloride/sodium citrate (SSC) at about 45?C, followed by 
one or more washes in 0.2 X SSC, 0.1% SDS at 50-65?C. 
Preferably, an isolated nucleic acid molecule of the 
invention that hybridizes under stringent conditions to 

3 5 the sequence of SEQ ID NO:l, SEQ ID NO: 3, the cDNA of 
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ATCC corresponds to a naturally-occurring nucleic 

acid molecule. As used herein, a "naturally-occurring" 
nucleic acid molecule refers to an RNA or DNA molecule 
having a nucleotide sequence that occurs in nature (e.g., 
5 encodes a natural protein) . 

In addition to naturally-occurring allelic 
variants of the CARD- 3 or CARD-4 sequence that may exist 
in the population, the skilled artisan will further 
appreciate that changes can be introduced by mutation 

10 into the nucleotide sequence of SEQ ID NO:l, SEQ ID NO: 3, 
SEQ ID NO: 7, SEQ ID NO : 9 , SEQ ID NO: 25, SEQ ID NO: 26, SEQ 
ID NO:38, SEQ ID NO:40, SEQ ID NO:42, the cDNA of ATCC 

, the cDNA of ATCC , or the cDNA of ATCC 

, thereby leading to changes in the amino acid 

15 sequence of the encoded CARD -3 , CARD- 4 L/ S protein, CARD-4 
splice variant, or murine CARD-4 without altering the 
functional ability of the CARD- 3 , CARD-4L/S, CARD-4 
splice variant, or murine CARD-4 protein. For example, 
one can make nucleotide substitutions leading to amino 

2 0 acid substitutions at "non-essential" amino acid 

residues. A "non-essential" amino acid residue is a 
residue that can be altered from the wild-type sequence 
of CARD- 3 , CARD-4L/S, CARD-4 splice variant, or murine 
CARD-4 protein (e.g., the sequence of SEQ ID NO:2, SEQ ID 
25 NO: 8, SEQ ID NO:26, SEQ ID NO:39, SEQ ID NO:41 and SEQ ID 
NO: 43) without altering the biological activity, whereas 
an "essential" amino acid residue is required for 
biological activity. For example, amino acid residues 
that are conserved among the CARD- 3 , CARD-4L/S, CARD-4 

3 0 splice variant, or murine CARD-4 proteins of various 

species are predicted to be particularly unamenable to 
alteration. 

For example, preferred CARD- 3 or CARD-4 proteins 
of the present invention, contain at least one CARD 
35 domain. Additionally, a CARD-3 protein also contains at 
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least one kinase domain or at least one linker domain. A 
CARD domain contains at least one nucleotide binding 
domain or Leucine-rich repeats. Such conserved domains 
are less likely to be amenable to mutation. Other amino 
5 acid residues, however, (e.g., those that are not 

conserved or only semi -conserved among CARD -3 or CARD -4 
of various species) may not be essential for activity and 
thus are likely to be amenable to alteration. 

Accordingly, another aspect of the invention 

10 pertains to nucleic acid molecules encoding CARD-3 or 
CARD-4 proteins that contain changes in amino acid 
residues that are not essential for activity. Such 
CARD-3 or CARD-4 proteins differ in amino acid sequence 
from SEQ ID NO : 2 , SEQ ID NO : 8 , SEQ ID NO: 25, SEQ ID 

15 NO: 39, SEQ ID NO : 4 1 , or SEQ ID NO: 43 and yet retain 
biological activity. In one embodiment, the isolated 
nucleic acid molecule includes a nucleotide sequence 
encoding a protein that includes an amino acid sequence 
that is at least about 45% identical, 65%, 75%, 85%, 95%, 

20 or 98% identical to the amino acid sequence of SEQ ID 
NO:2, SEQ ID NO : 8 , SEQ ID NO:26, SEQ ID NO:39, SEQ ID 
NO: 41 or SEQ ID NO: 43. 

An isolated nucleic acid molecule encoding a 
CARD-3 or CARD-4 proteins having a sequence which differs 

25 from that of SEQ ID NO : 2 , SEQ ID NO : 8 , SEQ ID NO: 26, SEQ 
ID NO: 39, SEQ ID NO : 4 1 or SEQ ID NO: 43, can be created by 
introducing one or more nucleotide substitutions, 
additions or deletions into the nucleotide sequence of 
CARD-3 (SEQ ID NO : 1 , SEQ ID NO : 3 , the cDNA of ATCC ) 

30 or CARD- 4 L (SEQ ID NO : 7 , SEQ ID NO : 9 , the cDNA of 

ATCC ), or CARD- 4 S (SEQ ID NO:25, SEQ ID NO:27, the 

cDNA of ATCC ) , or human CARD-4 splice variants (SEQ 

ID NO: 38, SEQ ID NO : 4 0 , the cDNA of ATCC or the cDNA 

of ATCC ) , or murine CARD-4 (SEQ ID NO: 42) such that 

35 one or more amino acid substitutions, additions or 
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deletions are introduced into the encoded protein. 
Mutations can be introduced by standard techniques, such 
as site-directed mutagenesis and PCR-mediated 
mutagenesis. Preferably, conservative amino acid 
5 substitutions are made at one or more predicted 

non-essential amino acid residues. A "conservative amino 
acid substitution" is one in which the amino acid residue 
is replaced with an amino acid residue having a similar 
side chain. Families of amino acid residues having 

10 similar side chains have been defined in the art. These 
families include amino acids with basic side chains 
(e.g. , lysine, arginine, histidine) , acidic side chains 
(e.g. , aspartic acid, glutamic acid) , uncharged polar 
side chains (e.g., glycine, asparagine, glutamine, 

15 serine, threonine, tyrosine, cysteine) , nonpolar side 
chains (e.g., alanine, valine, leucine, isoleucine, 
proline, phenylalanine, methionine, tryptophan) , 
beta-branched side chains (e.g., threonine, valine, 
isoleucine) and aromatic side chains (e.g., tyrosine, 

20 phenylalanine, tryptophan, histidine) . Thus, a predicted 
nonessential amino acid residue in CARD-3 or CARD-4 is 
preferably replaced with another amino acid residue from 
the same side chain family. Alternatively, mutations can 
be introduced randomly along all or part of a CARD- 3 or 

2 5 CARD- 4 coding sequence, such as by saturation 

mutagenesis, and the resultant mutants can be screened 
for CARD-3 or CARD-4 biological activity to identify 
mutants that retain activity. Following mutagenesis, the 
encoded protein can be expressed recombinantly and the 

3 0 activity of the protein can be determined. 

In an embodiment, a mutant CARD-3 or CARD-4 
protein can be assayed for: (1) the ability to form 
protein: protein interactions with proteins in the 
apoptotic signalling pathway; (2) the ability to bind a 
35 CARD-3 or CARD-4 ligand; or (3) the ability to bind to an 
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intracellular target protein. For example, (1) in 
Example 7, a two-hybrid screening assay for the physical 
interaction of CARD- 3 and CARD- 4 is shown, (2) in Example 
8, a two-hybrid system assay for the interaction between 
5 CARD-4 and its ligand hNUDC is described, and (3) in 
Example 12, a coimmunoprecipitat ion assay for the 
interaction of CARD-3 with its ligand CARD-4 is shown. 
In yet another embodiment, a mutant CARD-3 or CARD-4 
protein can be assayed for the ability to modulate 

10 cellular proliferation, cellular differentiation, or 
cellular death. For example, in Example 10, assays for 
the regulation of cellular death (apoptosis) by CARD-3 or 
CARD-4 are described. In yet another embodiment, a 
mutant CARD-3 or CARD-4 protein can be assayed for 

15 regulation of a cellular signal transduction pathway. 

For example, in Example 9, an assay for the regulation by 
CARD-4 of the NF-kB pathway is described. 

The present invention encompasses antisense 
nucleic acid molecules, i.e., molecules which are 

20 complementary to a sense nucleic acid encoding a protein, 
e.g., complementary to the coding strand of a 
double -stranded cDNA molecule or complementary to an mRNA 
sequence. Accordingly, an antisense nucleic acid can 
hydrogen bond to a sense nucleic acid. The antisense 

2 5 nucleic acid can be complementary to an entire CARD-3 or 
CARD-4 coding strand, or to only a portion thereof, e.g., 
all or part of the protein coding region (or open reading 
frame) . An antisense nucleic acid molecule can be 
antisense to a noncoding region of the coding strand of a 

30 nucleotide sequence encoding CARD-3 or CARD-4. The 

noncoding regions ("5' and 3' untranslated regions") are 
the 5' and 3' sequences which flank the coding region and 
are not translated into amino acids. 

Given the coding strand sequences encoding CARD-3 

35 or CARD-4 disclosed herein (e.g., SEQ ID NO:l, SEQ ID 



WO 99/40102 



PCT/US99/02544 



NO: 3, SEQ ID NO : 7 , SEQ ID NO : 9 , SEQ ID NO: 25, SEQ ID 
NO:27, SEQ ID NO : 3 8 , SEQ ID NO:40, or SEQ ID NO:42), 
antisense nucleic acids of the invention can be designed 
according to the rules of Watson and Crick base pairing. 
5 The antisense nucleic acid molecule can be complementary 
to the entire coding region of CARD-3 or CARD-4L/S mRNA, 
but more preferably is an oligonucleotide which is 
antisense to only a portion of the coding or noncoding 
region of CARD-3 or CARD-4 mRNA. For example, the 
10 antisense oligonucleotide can be complementary to the 
region surrounding the translation start site of CARD-3 
mRNA, e.g., an oligonucleotide having the sequence 
CCCTGGTACTTGCCCCTCCGGTAG (SEQ ID NO: 34) or 

CCTGGTACTTGCCCCTCC (SEQ ID NO: 35) or of the CARD-4L mRNA 

15 e.g., TCGTTAAGCCCTTGAAGACAGTG (SEQ ID NO: 36) and 

TCGTTAGCCCTTGAAGACCAGTGAGTGTAG (SEQ ID NO: 37) . An 
antisense oligonucleotide can be, for example, about 5, 
10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in 
length. An antisense nucleic acid of the invention can 

20 be constructed using chemical synthesis and enzymatic 
ligation reactions using procedures known in the art. 
For example, an antisense nucleic acid (e.g., an 
antisense oligonucleotide) can be chemically synthesized 
using naturally occurring nucleotides or variously 

25 modified nucleotides designed to increase the biological 
stability of the molecules or to increase the physical 
stability of the duplex formed between the antisense and 
sense nucleic acids, e.g., phosphorothioate derivatives 
and acridine substituted nucleotides can be used. 

3 0 Examples of modified nucleotides which can be used to 
generate the antisense nucleic acid include 
5-f luorouracil , 5-bromouracil , 5-chlorouracil , 
5 - iodouracil , hypoxanthine , xanthine , 4 - acetyl cytosine , 
5- (carboxyhydroxylmethyl) uracil, 

3 5 5 - carboxymethy laminomethyl - 2 - 1 hiour idine , 
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5-carboxymethylaminomethyluracil , dihydrouracil , 
beta -D-ga lac tosylqueosine , inosine , 

N6 - isopentenyladenine , 1 -methylguanine , 1 -methylinosine , 
2 , 2 -dimethylguanine , 2 -methyladenine , 2 -methylguanine , 
5 3 -methyl cytosine , 5 -methylcytosine , N6-adenine, 
7 -methylguanine , 5-methylaminomethyluracil , 
5 -methoxyaminomethyl -2 - thiouracil , 

beta-D-mannosylqueosine , 5 ' -methoxycarboxymethyluracil , 
5 -methoxyuracil , 2 -methyl thio-N6 - isopentenyladenine , 

10 uracil - 5 -oxyacetic acid (v) , wybutoxosine , pseudouracil , 
queosine , 2 - thiocytosine , 5 -methyl - 2 - thiouracil , 
2 - thiouracil , 4 -thiouracil , 5 -methyl uracil , 
uracil - 5 -oxyacetic acid methyl ester, uracil -5 -oxyacetic 
acid (v) , 5-methyl-2 -thiouracil , 

15 3- ( 3 - amino - 3 - N - 2 - carboxypropy 1 ) uracil, (acp3) w, and 

2 , 6 -diaminopurine . Alternatively, the antisense nucleic 
acid can be produced biologically using an expression 
vector into which a nucleic acid has been subcloned in an 
antisense orientation (i.e., RNA transcribed from the 

20 inserted nucleic acid will be of an antisense orientation 
to a target nucleic acid of interest, described further 
in the following subsection) . 

The antisense nucleic acid molecules of the 
invention are typically administered to a subject or 

25 generated in situ such that they hybridize with or bind 
to cellular mRNA and/or genomic DNA encoding a CARD- 3 or 
CARD-4 protein to thereby inhibit expression of the 
protein, e.g., by inhibiting transcription and/or 
translation. The hybridization can be by conventional 

30 nucleotide complementarity to form a stable duplex, or, 
for example, in the case of an antisense nucleic acid 
molecule which binds to DNA duplexes, through specific 
interactions in the major groove of the double helix. An 
example of a route of administration of antisense nucleic 

35 acid molecules of the invention include direct injection 
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at a tissue site. Alternatively, antisense nucleic acid 
molecules can be modified to target selected cells and 
then administered systemically . For example, for 
systemic administration, antisense molecules can be 
5 modified such that they specifically bind to receptors or 
antigens expressed on a selected cell surface, e.g., by 
linking the antisense nucleic acid molecules to peptides 
or antibodies which bind to cell surface receptors or 
antigens. The antisense nucleic acid molecules can also 

10 be delivered to cells using the vectors described herein. 
To achieve sufficient intracellular concentrations of the 
antisense molecules, vector constructs in which the 
antisense nucleic acid molecule is placed under the 
control of a strong pol II or pol III promoter are 

15 preferred. 

An antisense nucleic acid molecule of the 
invention can be an a-anomeric nucleic acid molecule. An 
a-anomeric nucleic acid molecule forms specific 
double- stranded hybrids with complementary RNA in which, 

2 0 contrary to the usual £-units, the strands run parallel 
to each other (Gaultier et al . (1987) Nucleic Acids. Res. 
15:6625-6641). The antisense nucleic acid molecule can 
also comprise a 2 ' -o-methylribonucleotide (Inoue et al . 
(1987) Nucleic Acids Res. 15:6131-6148) or a chimeric 

2 5 RNA-DNA analogue (Inoue et al . (1987) FEBS Lett. 

215:327-330) . 

The invention also encompasses ribozymes . 
Ribozymes are catalytic RNA molecules with ribonuclease 
activity which are capable of cleaving a single -stranded 

3 0 nucleic acid, such as an mRNA, to which they have a 

complementary region. Thus, ribozymes (e.g., hammerhead 
ribozymes (described in Haselhoff and Gerlach (1988) 
Nature 334:585-591)) can be used to catalytically cleave 
CARD- 3 or CARD-4 mRNA transcripts to thereby inhibit 
35 translation of CARD-3 or CARD-4 mRNA. A ribozyme having 
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specificity for a CARD - 3 or CARD- 4 -encoding nucleic acid 
can be designed based upon the nucleotide sequence of a 
CARD-3 or CARD-4 cDNA disclosed herein (e.g., SEQ ID 
NO:l, SEQ ID NO : 3 , SEQ ID NO : 7 , SEQ ID NO : 9 , SEQ ID 
5 NO:25, SEQ ID NO:27, SEQ ID NO:38, SEQ ID NO : 4 0 , and SEQ 
ID NO: 42) . For example, a derivative of a Tetrahymena 
L-19 IVS RNA can be constructed in which the nucleotide 
sequence of the active site is complementary to the 
nucleotide sequence to be cleaved in a CARD-3 or 

10 CARD-4 -encoding mRNA. See, e.g., Cech et al . U.S. Patent 
No. 4,987,071; and Cech et al . U.S. Patent No. 5,116,742. 
Alternatively, CARD-3 or CARD-4 mRNA can be used to 
select a catalytic RNA having a specific ribonuclease 
activity from a pool of RNA molecules. See, e.g., Bartel 

15 and Szostak (1993) Science 261:1411-1418. 

The invention also encompasses nucleic acid 
molecules which form triple helical structures. For 
example, CARD-3 or CARD-4 gene expression can be 
inhibited by targeting nucleotide sequences complementary 

20 to the regulatory region of the CARD-3 or CARD-4 (e.g., 
the CARD-3 or CARD-4 promoter and/or enhancers) to form 
triple helical structures that prevent transcription of 
the CARD-3 or CARD-4 gene in target cells. See 
generally, Helene (1991) Anticancer Drug Des . 

25 6(6):569-84; Helene (1992) Ann. N.Y. Acad. Sci . 

660:27-36; and Maher (1992) Bioassays 14 (12) : 807-15 . 

In embodiments, the nucleic acid molecules of the 
invention can be modified at the base moiety, sugar 
moiety or phosphate backbone to improve, e.g., the 

30 stability, hybridization, or solubility of the molecule. 
For example, the deoxyribose phosphate backbone of the 
nucleic acids can be modified to generate peptide nucleic 
acids (see Hyrup et al . (1996) Bioorganic & Medicinal 
Chemistry 4(1) : 5-23) . As used herein, the terms 

35 "peptide nucleic acids" or "PNAs" refer to nucleic acid 
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mimics, e.g., DNA mimics, in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide 
backbone and only the four natural nucleobases are 
retained. The neutral backbone of PNAs has been shown to 
5 allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA 
oligomers can be performed using standard solid phase 
peptide synthesis protocols as described in Hyrup et al . 
(1996) supra; Perry-O' Keefe et al . (1996) Proc . Natl. 

10 Acad. Sci. USA 93: 14670-675. 

PNAs of CARD- 3 or CARD -4 can be used for 
therapeutic and diagnostic applications. For example, 
PNAs can be used as antisense or antigene agents for 
sequence-specific modulation of gene expression by, e.g., 

15 inducing transcription or translation arrest or 

inhibiting replication. PNAs of CARD-3 or CARD-4 can 
also be used, e.g., in the analysis of single base pair 
mutations in a gene by, e.g., PNA directed PCR clamping; 
as 'artificial restriction enzymes when used in 

20 combination with other enzymes, e.g., SI nucleases (Hyrup 
(1996) supra; or as probes or primers for DNA sequence 
and hybridization (Hyrup (1996) supra; Perry-O' Keefe et 
al. (1996) Proc. Natl. Acad. Sci. USA 93: 14670-675). 

In another embodiment, PNAs of CARD-3 or CARD-4 

25 can be modified, e.g., to enhance their stability or 

cellular uptake, by attaching lipophilic or other helper 
groups to PNA, by the formation of PNA -DNA chimeras, or 
by the use of liposomes or other techniques of drug 
delivery known in the art. For example, PNA -DNA chimeras 

3 0 of CARD-3 or CARD-4 can be generated which may combine 
the advantageous properties of PNA and DNA. Such 
chimeras allow DNA recognition enzymes, e.g., RNAse H and 
DNA polymerases, to interact with the DNA portion while 
the PNA portion would provide high binding affinity and 

35 specificity. PNA -DNA chimeras can be linked using 
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linkers of appropriate lengths selected in terms of base 
stacking, number of bonds between the nucleobases, and 
orientation (Hyrup (1996) supra) . The synthesis of 
PNA - DNA chimeras can be performed as described in Hyrup 
5 (1996) supra and Finn et al . (1996) Nucleic Acids 

Research 24(17) : 3357-63. For example, a DNA chain can be 
synthesized on a solid support using standard 
phosphoramidite coupling chemistry and modified 
nucleoside analogs , e.g., 

10 5' - (4 -methoxytrityl ) amino- 5' -deoxy- thymidine 

phosphoramidite, can be used as a between the PNA and the 
5' end of DNA (Mag et al . (1989) Nucleic Acid Res. 
17:5973-88) . PNA monomers are then coupled in a stepwise 
manner to produce a chimeric molecule with a 5' PNA 

15 segment and a 3' DNA segment (Finn et al . (1996) Nucleic 
Acids Research 24 ( 17 ) : 3 3 57 - 63 ) . Alternatively, chimeric 
molecules can be synthesized with a 5' DNA segment and a 
3' PNA segment (Peterser et al . (1975) Bioorganic Med. 
Chem. Lett. 5:1119-11124) . 

20 In other embodiments, the oligonucleotide may 

include other appended groups such as peptides (e.g., for 
targeting host cell receptors in vivo) , or agents 
facilitating transport across the cell membrane (see, 
e.g., Letsinger et al . (1989) Proc . Natl. Acad. Sci . USA 

25 86:6553-6556; Lemaitre et al . (1987) Proc. Natl. Acad. 

Sci. USA 84:648-652; PCT Publication No. WO 88/09810) or 
the blood-brain barrier (see, e.g., PCT Publication No. 
W0 89/10134) . In addition, oligonucleotides can be 
modified with hybridization- triggered cleavage agents 

30 (see, e.g., Krol et al . (1988) Bio/Techniques 6:958-976) 
or intercalating agents (see, e.g., Zon (1988) Pharm. 
Res. 5:539-549). To this end, the oligonucleotide may be 
conjugated to another molecule, e.g., a peptide, 
hybridization triggered cross-linking agent, transport 

3 5 agent, hybridization- triggered cleavage agent, etc. 
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II. Isolated CARD- 3 or CARD -4 Proteins and Anti - CARD- 3 
or CARD-4 Antibodies. 

One aspect of the invention pertains to isolated 
CARD-3 or CARD-4 proteins, and biologically active 
5 portions thereof, as well as polypeptide fragments 

suitable for use as immunogens to raise anti -CARD- 3 or 
CARD-4 antibodies. In one embodiment, native CARD-3 or 
CARD-4 proteins can be isolated from cells or tissue 
sources by an appropriate purification scheme using 

10 standard protein purification techniques. In another 
embodiment, CARD-3 or CARD-4 proteins are produced by 
recombinant DNA techniques. Alternative to recombinant 
expression, a CARD-3 or CARD-4 protein or polypeptide can 
be synthesized chemically using standard peptide 

15 synthesis techniques. 

An "isolated" or "purified" protein or 
biologically active portion thereof is substantially free 
of cellular material or other contaminating proteins from 
the cell or tissue source from which the CARD-3 or CARD-4 

2 0 protein is derived, or substantially free from chemical 

precursors or other chemicals when chemically 
synthesized. The language "substantially free of 
cellular material" includes preparations of CARD-3 or 
CARD-4 protein in which the protein is separated from 

25 cellular components of the cells from which it is 

isolated or recombinant ly produced. Thus, CARD-3 or 
CARD-4 protein that is substantially free of cellular 
material includes preparations of CARD-3 or CARD-4 
protein having less than about 30%, 20%, 10%, or 5% (by 

30 dry weight) of non-CARD-3 or CARD-4 protein (also 

referred to herein as a "contaminating protein") . When 
the CARD-3 or CARD-4 protein or biologically active 
portion thereof is recombinantly produced, it is also 
preferably substantially free of culture medium, i.e., 

3 5 culture medium represents less than about 20%, 10%, or 5% 
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of the volume of the protein preparation. When CARD - 3 or 
CARD-4 protein is produced by chemical synthesis, it is 
preferably substantially free of chemical precursors or 
other chemicals, i.e., it is separated from chemical 
5 precursors or other chemicals which are involved in the 
synthesis of the protein. Accordingly such preparations 
of CARD- 3 or CARD-4 protein have less than about 3 0%, 
20%, 10%, 5% (by dry weight) of chemical precursors or 
non-CARD-3 or CARD-4 chemicals. 

10 Biologically active portions of a CARD-3 or CARD-4 

protein include peptides comprising amino acid sequences 
sufficiently identical to or derived from the amino acid 
sequence of the CARD-3 or CARD-4 protein (e.g., the amino 
acid sequence shown in SEQ ID NO: 2, SEQ ID NO: 8, SEQ ID 

15 NO: 26, SEQ ID NO : 3 9 , SEQ ID NO: 41 or SEQ ID NO: 43) , which 
include less amino acids than the full length CARD-3 or 
CARD-4 proteins, and exhibit at least one activity of a 
CARD-3 or CARD-4 protein. Typically, biologically active 
portions comprise a domain or motif with at least one 

20 activity of the CARD-3 or CARD-4 protein. A biologically 
active portion of a CARD-3 or CARD-4 protein can be a 
polypeptide which is, for example, 10, 25, 50, 100 or 
more amino acids in length. Preferred biologically 
active polypeptides include one or more identified CARD-3 

25 or CARD-4 structural domains, e.g., the CARD domain (SEQ 
ID NO: 6 or SEQ ID NO: 10 or SEQ ID NO: 27) . 

Moreover, other biologically active portions, in 
which other regions of the protein are deleted, can be 
prepared by recombinant techniques and evaluated for one 

30 or more of the functional activities of a native CARD-3 
or CARD-4 protein. 

CARD-3 or CARD-4 protein has the amino acid 
sequence shown of SEQ ID NO: 2, SEQ ID NO : 8 , SEQ ID NO: 26, 
SEQ ID NO: 39, SEQ ID NO : 4 1 or SEQ ID NO: 43. Other useful 

35 CARD-3 or CARD-4 proteins are substantially identical to 
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SEQ ID NO: 2 or SEQ ID NO: 8 or SEQ ID NO: 26, SEQ ID NO: 3 9 
or SEQ ID NO: 41 or SEQ ID NO: 43 and retain the functional 
activity of the protein of SEQ ID NO: 2 or SEQ ID NO : 8 or 
SEQ ID NO: 26, SEQ ID NO: 39 or SEQ ID NO: 41 or SEQ ID 
5 NO: 43 yet differ in amino acid sequence due to natural 
allelic variation or mutagenesis. CARD-3 and CARD-4 are 
involved in activating caspases in the apoptotic pathway. 
For example, in Example 10, CARD-4 is shown to enhance 
caspase 9 activity. Accordingly, a useful CARD-3 or 

10 CARD-4 protein is a protein which includes an amino acid 
sequence at least about 45%, preferably 55%, 65%, 75%, 
85%, 95%, or 99% identical to the amino acid sequence of 
SEQ ID NO: 2 or SEQ ID NO: 8 or SEQ ID NO: 26, SEQ ID NO:39 
or SEQ ID NO: 41 or SEQ ID NO: 43 and retains the 

15 functional activity of the CARD-3 or CARD-4 proteins of 
SEQ ID NO: 2 or SEQ ID NO : 8 or SEQ ID NO: 26, SEQ ID NO: 39 
or SEQ ID NO: 41 or SEQ ID NO: 43. In other instances, the 
CARD-3 or CARD-4 protein is a protein having an amino 
acid sequence 55%, 65%, 75%, 85%, 95%, or 98% identical 

20 to the CARD-3 or CARD-4L CARD domain (SEQ ID NO: 6, SEQ ID 
NO:10 and SEQ ID NO:27). In an embodiment, the CARD-3 or 
CARD-4 protein retains a functional activity of the 
CARD-3 or CARD-4 protein of SEQ ID NO: 2, SEQ ID NO: 8 or 
SEQ ID NO: 26, SEQ ID NO: 39 or SEQ ID NO: 41 or SEQ ID 

25 NO:43. 

To determine the percent identity of two amino 
acid sequences or of two nucleic acids, the sequences are 
aligned for optimal comparison purposes (e.g., gaps can 
be introduced in the sequence of a first amino acid or 

30 nucleic acid sequence for optimal alignment with a second 
amino or nucleic acid sequence) . The amino acid residues 
or nucleotides at corresponding amino acid positions or 
nucleotide positions are then compared. When a position 
in the first sequence is occupied by the same amino acid 

35 residue or nucleotide as the corresponding position in 
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the second sequence, then the molecules are identical at 
that position. The percent identity between the two 
sequences is a function of the number of identical 
positions shared by the sequences (i.e., % identity = # 
5 of identical positions/total # of positions x 100) . 

The determination of percent homology between two 
sequences can be accomplished using a mathematical 
algorithm. A preferred, non-limiting example of a 
mathematical algorithm utilized for the comparison of two 

10 sequences is the algorithm of Karlin and Altschul (1990) 
Proc. Nat'l Acad. Sci . USA 87:2264-2268, modified as in 
Karlin and Altschul (1993) Proc. Nat'l Acad. Sci. USA 
90:5873-5877. Such an algorithm is incorporated into the 
NBLAST and XBLAST programs of Altschul, et al . (1990) J. 

15 Mol . Biol. 215:403-410. BLAST nucleotide searches can be 
performed with the NBLAST program, score = 100, 
wordlength = 12 to obtain nucleotide sequences similar or 
homologous to CARD- 3 or CARD-4 nucleic acid molecules of 
the invention. For example, Example 5 describes the use 

2 0 of the TBLASTN program to query a database of sequences 

of full length and partial cDNA sequences with the human 
CARD-4 polypeptide sequence leading to the discovery of 
murine CARD-4 and Example 4 describes the use of BLASTN 
to query a proprietary EST database with the 5' 
25 untranslated sequence of CARD-4 leading to the discovery 
of two human CARD-4 splice variants. BLAST protein 
searches can be performed with the XBLAST program, score 
= 50, wordlength = 3 to obtain amino acid sequences 
homologous to CARD-3 or CARD-4 protein molecules of the 

3 0 invention. To obtain gapped alignments for comparison 

purposes, Gapped BLAST can be utilized as described in 
Altschul et al . , (1997) Nucleic Acids Res. 25:3389-3402. 
When utilizing BLAST and Gapped BLAST programs, the 
default parameters of the respective programs (e.g., 
3 5 XBLAST and NBLAST) can be used. See 
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http://www.ncbi-nlm.nih.gov. Another preferred, 
non-limiting example of a mathematical algorithm utilized 
for the comparison of sequences is the algorithm of Myers 
and Miller, CABIOS (1989) . Such an algorithm is 
5 incorporated into the ALIGN program (version 2.0) which 
is part of the GCG sequence alignment software package. 
When utilizing the ALIGN program for comparing amino acid 
sequences, a PAM12 0 weight residue table, a gap length 
penalty of 12, and a gap penalty of 4 can be used. 

10 The percent identity between two sequences can be 

determined using techniques similar to those described 
above, with or without allowing gaps. In calculating 
percent identity, only exact matches are counted. 

The invention also provides CARD-3 or CARD-4 

15 chimeric or fusion proteins. As used herein, a CARD-3 or 
CARD-4 "chimeric protein" or "fusion protein" comprises a 
CARD-3 or CARD-4 polypeptide operatively linked to a 
non-CARD-3 or CARD-4 polypeptide. A "CARD-3 or CARD-4 
polypeptide" refers to a polypeptide having an amino acid 

20 sequence corresponding to CARD-3 or CARD-4L/S, murine 
CARD-4 or human CARD-4 splice variants, whereas a 
"non-CARD-3 or CARD-4 polypeptide" refers to a 
polypeptide having an amino acid sequence corresponding 
to a protein which is not substantially identical to the 

25 CARD-3 or CARD- 4 L/ S protein, murine CARD-4, or human 

CARD-4 splice variants e.g., a protein which is different 
from the CARD-3 or CARD-4 proteins and which is derived 
from the same or a different organism. Within a CARD-3 
or CARD-4L fusion protein, the CARD-3 or CARD-4 

30 polypeptide can correspond to all or a portion of a 
CARD-3 or CARD-4 protein, preferably at least one 
biologically active portion of a CARD-3 or CARD-4 
protein. Within the fusion protein, the term 
"operatively linked" is intended to indicate that the 

35 CARD-3 or CARD-4 polypeptide and the non-CARD-3 or 
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nan -CARD -4 polypeptide are fused in- frame to each other. 
The non-CARD-3 or non-CARD-4 polypeptide can be fused to 
the N-terminus or C-terminus of the CARD-3 or CARD-4 
polypeptide . 

5 One useful fusion protein is a GST-CARD-3 or 

GST-CARD-4 fusion protein in which the CARD-3 or CARD-4 
sequences are fused to the C-terminus of the GST 
sequences. Such fusion proteins can facilitate the 
purification of recombinant CARD-3 or CARD-4. 

10 In another embodiment, the fusion protein contains 

a signal sequence from another protein. In certain host 
cells (e.g., mammalian host cells), expression and/or 
secretion of CARD-3 or CARD-4 can be increased through 
use of a heterologous signal sequence. For example, the 

15 gp67 secretory sequence of the baculovirus envelope 
protein can be used as a heterologous signal sequence 
(Current Protocols in Molecular Biology, Ausubel et al . , 
eds . , John Wiley & Sons, 1992). Other examples of 
eukaryotic heterologous signal sequences include the 

20 secretory sequences of melittin and human placental 

alkaline phosphatase (Stratagene; La Jolla, California) . 
In yet another example, useful prokaryotic heterologous 
signal sequences include the phoA secretory signal 
(Molecular cloning, Sambrook et al, second edition, Cold 

25 spring harbor laboratory press, 1989) and the protein A 
secretory signal (Pharmacia Biotech; Piscataway, New 
Jersey) . 

In yet another embodiment, the fusion protein is a 
CARD-3 or CARD-4 - immunoglobulin fusion protein in which 

30 all or part of CARD-3 or CARD-4 is fused to sequences 
derived from a member of the immunoglobulin protein 
family. The CARD-3 or CARD-4 - immunoglobulin fusion 
proteins of the invention can be incorporated into 
pharmaceutical compositions and administered to a subject 

35 to inhibit an interaction between a CARD-3 or CARD-4 
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ligand and a CARD- 3 or CARD-4 protein on the surface of a 
cell, to thereby suppress CARD-3 or CARD-4 -mediated 
signal transduction in vivo. The CARD-3 or 
CARD-4 - immunoglobulin fusion proteins can be used to 
5 affect the bioavailability of a CARD-3 or CARD-4 cognate 
ligand. Inhibition of the CARD-3 ligand/CARD-3 or CARD-4 
ligand/CARD-4 interaction may be useful therapeutically 
for both the treatment of prol i f erat ive and 
different iative disorders, as well as modulating (e.g. 

10 promoting or inhibiting) cell survival. Moreover, the 
CARD-3 or CARD-4 - immunoglobulin fusion proteins of the 
invention can be used as immunogens to produce 
anti -CARD-3 or CARD-4 antibodies in a subject, to purify 
CARD-3 or CARD-4 ligands and in screening assays to 

15 identify molecules which inhibit the interaction of 
CARD-3 or CARD-4 with a CARD-3 or CARD-4 ligand. 

Preferably, a CARD-3 or CARD-4 chimeric or fusion 
protein of the invention is produced by standard 
recombinant DNA techniques. For example, DNA fragments 

20 coding for the different polypeptide sequences are 

ligated together in- frame in accordance with conventional 
techniques, for example by employing blunt -ended or 
stagger-ended termini for ligation, restriction enzyme 
digestion to provide for appropriate termini, filling-in 

25 of cohesive ends as appropriate, alkaline phosphatase 
treatment to avoid undesirable joining, and enzymatic 
ligation. In another embodiment, the fusion gene can be 
synthesized by conventional techniques including 
automated DNA synthesizers. Alternatively, PCR 

3 0 amplification of gene fragments can be carried out using 
anchor primers which give rise to complementary overhangs 
between two consecutive gene fragments which can 
subsequently be annealed and reamplified to generate a 
chimeric gene sequence (see, e.g., Current Protocols in 

35 Molecular Biology, Ausubel et al . eds . , John Wiley & 
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Sons: 1992) . Moreover, many expression vectors are 
commercially available that already encode a fusion 
moiety (e.g., a GST polypeptide). A CARD-3 or 
CARD- 4 -encoding nucleic acid can be cloned into such an 
5 expression vector such that the fusion moiety is linked 
in-frame to the CARD-3 or CARD-4 protein. 

The. present invention also pertains to variants of 
the CARD-3 or CARD-4 proteins which function as either 
CARD-3 or CARD - 4 agonists (mimetics) or as CARD-3 or 
10 CARD-4 antagonists. Variants of the CARD-3 or CARD-4 
protein can be generated by mutagenesis, e.g., discrete 
point mutation or truncation of the CARD-3 or CARD-4 
protein. An agonist of the CARD-3 or CARD-4 protein can 
retain substantially the same, or a subset, of the 
15 biological activities of the naturally occurring form of 
the CARD-3 or CARD-4 protein. An antagonist of the 
CARD-3 or CARD-4 protein can inhibit one or more of the 
activities of the naturally occurring form of the CARD-3 
or CARD-4 protein by, for example, competitively binding 
20 to a downstream or upstream member of a cellular 

signaling cascade which includes the CARD-3 or CARD-4 
protein. Thus, specific biological effects can be 
elicited by treatment with a variant of limited function. 
Treatment of a subject with a variant having a subset of 
2 5 the biological activities of the naturally occurring form 
of the protein can have fewer side effects in a subject 
relative to treatment with the naturally occurring form 
of the CARD-3 or CARD-4 proteins. 

Variants of the CARD-3 or CARD-4 protein which 
30 function as either CARD-3 or CARD-4 agonists (mimetics) 
or as CARD-3 or CARD-4 antagonists can be identified by 
screening combinatorial libraries of mutants, e.g., 
truncation mutants of the CARD-3 or CARD-4 protein for 
CARD-3 or CARD-4 protein agonist or antagonist activity. 
35 In one embodiment, a variegated library of CARD-3 or 
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CARD-4 variants is generated by combinatorial mutagenesis 
at the nucleic acid level and is encoded by a variegated 
gene library. A variegated library of CARD-3 or CARD-4 
variants can be produced by, for example, enzymat ically 
5 ligating a mixture of synthetic oligonucleotides into 
g ene sequences such that a degenerate set of potential 
CARD-3 or CARD-4 sequences is expressible as individual 
polypeptides, or alternatively, as a set of larger fusion 
proteins (e.g., for phage display) containing the set of 

10 CARD-3 or CARD-4 sequences therein. There are a variety 
of methods which can be used to produce libraries of 
potential CARD-3 or CARD-4 variants from a degenerate 
oligonucleotide sequence. Chemical synthesis of a 
degenerate gene sequence can be performed in an automatic 

15 DNA synthesizer, and the synthetic gene then ligated into 
an appropriate expression vector. Use of a degenerate 
set of genes allows for the provision, in one mixture, of 
all of the sequences encoding the desired set of 
potential CARD-3 or CARD-4 sequences. Methods for 

20 synthesizing degenerate oligonucleotides are known in the 
art (see, e.g., Narang (1983) Tetrahedron 39:3; Itakura 
et al . (1984) Annu. Rev. Biochem. 53:323; Itakura et al . 
(1984) Science 198:1056; Ike et al . (1983) Nucleic Acid 
Res . 11 :477) . 

25 Useful fragments of CARD-3 and CARD-4 include 

fragments comprising or consisting of a domain or 
subdomain described herein, e.g., a kinase domain or a 
CARD domain. 

In addition, libraries of fragments of the CARD-3 
3 0 or CARD-4 protein coding sequence can be used to generate 
a variegated population of CARD-3 or CARD-4 fragments for 
screening and subsequent selection of variants of a 
CARD-3 or CARD-4 protein. In one embodiment, a library 
of coding sequence fragments can be generated by treating 
35 a double stranded PCR fragment of a CARD-3 or CARD-4 
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coding sequence with a nuclease under conditions wherein 
nicking occurs only about once per molecule, denaturing 
the double stranded DNA, renaturing the DNA to form 
double stranded DNA which can include sense/ant isense 
5 pairs from different nicked products, removing single 
stranded portions from reformed duplexes by treatment 
with SI nuclease, and ligating the resulting fragment 
library into an expression vector. By this method, an 
expression library can be derived which encodes 

10 N-terminal and internal fragments of various sizes of the 
CARD- 3 or CARD-4 protein. 

Several techniques are known in the art for 
screening gene products of combinatorial libraries made 
by point mutations or truncation, and for screening cDNA 

15 libraries for gene products having a selected property. 
Such techniques are adaptable for rapid screening of the 
gene libraries generated by the combinatorial mutagenesis 
of CARD-3 or CARD-4 proteins. The most widely used 
techniques, which are amenable to high through-put 

20 analysis, for screening large gene libraries typically 
include cloning the gene library into replicable 
expression vectors, transforming appropriate cells with 
the resulting library of vectors, and expressing the 
combinatorial genes under conditions in which detection 

25 of a desired activity facilitates isolation of the vector 
encoding the gene whose product was detected. Recursive 
ensemble mutagenesis (REM) , a technique which enhances 
the frequency of functional mutants in the libraries, can 
be used in combination with the screening assays to 

3 0 identify CARD-3 or CARD-4 variants (Arkin and Yourvan 

(1992) Proc. Natl. Acad. Sci . USA 8 9:7811-7815; Delgrave 
et al . (1993) Protein Engineering 6 (3) : 327-331) . 

An isolated CARD-3 or CARD-4 protein, or a portion 
or fragment thereof, can be used as an immunogen to 

35 generate antibodies that bind CARD-3 or CARD-4 using 
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standard techniques for polyclonal and monoclonal 
antibody preparation. The full-length CARD-3 or CARD-4 
protein can be used or, alternatively, the invention 
provides antigenic peptide fragments of CARD-3 or CARD-4 
5 for use as immunogens . The antigenic peptide of CARD-3 or 
CARD-4 comprises at least 8 (preferably 10, 15, 20, or 
30) amino acid residues of the amino acid sequence shown 
in SEQ ID NO: 2, SEQ ID NO : 8 or SEQ ID NO: 26, or SEQ ID 
NO: 39 or SEQ ID NO: 41 or SEQ ID NO: 43 or polypeptides 

10 including amino acids 128-139 or 287-298 of human CARD - 4 L 
and encompasses an epitope of CARD-3 or CARD-4 such that 
an antibody raised against the peptide forms a specific 
immune complex with CARD-3 or CARD-4. 

Useful antibodies include antibodies which bind to 

15 a domain or subdomain of CARD-3 or CARD-4 described 
hherein, e.g., a kinase domain or a CARD domain). 

Preferred epitopes encompassed by the antigenic 
peptide are regions of CARD-3 or CARD-4 that are located 
on the surface of the protein, e.g., hydrophilic regions.. 

2 0 Other important criteria include a preference for a 
terminal sequence, high antigenic index (e.g., as 
predicted by Jameson-Wolf algorithm) , ease of peptide 
synthesis (e.g., avoidance of prolines); and high surface 
probability (e.g., as predicted by the Emini algorithm; 

2 5 Figure 8 and Figure 9) . 

A CARD-3 or CARD-4 immunogen typically is used to 
prepare antibodies by immunizing a suitable subject, 
(e.g., rabbit, goat, mouse or other mammal) with the 
immunogen. An appropriate immunogenic preparation can 

3 0 contain, for example, recombinantly expressed CARD-3 or 

CARD-4 protein or a chemically synthesized CARD-3 or 
CARD-4 polypeptide. The preparation can further include 
an adjuvant, such as Freund's complete or incomplete 
adjuvant, or similar immunostimulatory agent. 
3 5 Immunization of a suitable subject with an immunogenic 
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CARD - 3 or CARD -4 preparation induces a polyclonal 
ant i- CARD -3 or CARD- 4 antibody response. For example, 
polypeptides including amino acids 128-139 or 287-298 of 
human CARD - 4 L were conjugated to KLH and the resulting 
5 conjugates were used to immunize rabbits and polyclonal 
antibodies that specifically recognize the two immunogen 
peptides were generated. 

Accordingly, another aspect of the invention 
pertains to ant i -CARD- 3 or CARD-4 antibodies. The term 
10 "antibody" as used herein refers to immunoglobulin 
molecules and immunologically active portions of 
immunoglobulin molecules, i.e., molecules that contain an 
antigen binding site which specifically binds an antigen, 
such as CARD-3 or CARD-4. A molecule which specifically 
15 binds to CARD-3 or CARD-4 is a molecule which binds 

CARD-3 or CARD-4, but does not substantially bind other 
molecules in a sample, e.g., a biological sample, which 
naturally contains CARD-3 or CARD-4. Examples of 
immunologically active portions of immunoglobulin 
20 molecules include F(ab) and F(ab')2 fragments which can 
be generated by treating the antibody with an enzyme such 
as pepsin. The invention provides polyclonal and 
monoclonal antibodies that bind CARD-3 or CARD-4. The 
term "monoclonal antibody" or "monoclonal antibody 
25 composition", as used herein, refers to a population of 
antibody molecules that contain only one species of an 
antigen binding site capable of immunoreacting with a 
particular epitope of CARD-3 or CARD-4. A monoclonal 
antibody composition thus typically displays a single 
30 binding affinity for a particular CARD-3 or CARD-4 
protein with which it immunoreacts . 

Polyclonal anti -CARD-3 or CARD-4 antibodies can be 
prepared as described above by immunizing a suitable 
subject with a CARD-3 or CARD-4 immunogen. The 
35 anti-CARD-3 or CARD-4 antibody titer in the immunized 
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subject can be monitored over time by standard 
techniques, such as with an enzyme linked immunosorbent 
assay (ELISA) using immobilized CARD-3 or CARD-4 . If 
desired, the antibody molecules directed against CARD-3 
5 or CARD-4 can be isolated from the mammal (e.g., from the 
blood) and further purified by well-known techniques, 
such as protein A chromatography to obtain the IgG 
fraction. At an appropriate time after immunization, 
e.g., when the anti-CARD-3 or CARD-4 antibody titers are 

10 highest, antibody-producing cells can be obtained from 
the subject and used to prepare monoclonal antibodies by 
standard techniques, such as the hybridoma technique 
originally described by Kohler and Milstein (1975) Nature 
256:495-497, the human B cell hybridoma technique (Kozbor 

15 et al . (1983) Immunol Today 4:72), the EBV-hybridoma 

technique (Cole et al . (1985), Monoclonal Antibodies and 
Cancer Therapy, Alan R. Liss, Inc., pp. 77-96) or trioma 
techniques. The technology for producing various 
antibodies monoclonal antibody hybridomas is well known 

20 (see generally Current Protocols in Immunology (1994) 

Coligan et al . (eds.) John Wiley & Sons, Inc., New York, 
NY) . Briefly, an immortal cell line (typically a 
myeloma) is fused to lymphocytes (typically splenocytes) 
from a mammal immunized with a CARD-3 or CARD-4 immunogen 

2 5 as described above, and the culture supernatant s of the 

resulting hybridoma cells are screened to identify a 
hybridoma producing a monoclonal antibody that binds 
CARD-3 or CARD-4 . 

Any of the many well known protocols used for 

3 0 fusing lymphocytes and immortalized cell lines can be 

applied for the purpose of generating an anti-CARD-3 or 
CARD-4 monoclonal antibody (see, e.g., Current Protocols 
in Immunology, supra; Gal f re et al . (1977) Nature 
266:55052; R.H. Kenneth, in Monoclonal Antibodies: A New 
35 Dimension In Biological Analyses, Plenum Publishing 
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Corp., New York:, New York (1980); and Lerner (1981) Yale 
J. Biol. Med., 54:387-402. Moreover, the ordinarily 
skilled worker will appreciate that there are many 
variations of such methods which also would be useful. 
5 Typically, the immortal cell line (e.g., a myeloma cell 
line) is derived from the same mammalian species as the 
lymphocytes. For example, murine hybridomas can be made 
by fusing lymphocytes from a mouse immunized with an 
immunogenic preparation of the present invention with an 

10 immortalized mouse cell line, e.g., a myeloma cell line 
that is sensitive to culture medium containing 
hypoxanthine , aminopterin and thymidine ("HAT medium"). 
Any of a number of myeloma cell lines can be used as a 
fusion partner according to standard techniques, e.g., 

15 the P3-NSl/l-Ag4-l, P3 -x63 -Ag8 . 653 or Sp2/0-Agl4 myeloma 
lines. These myeloma lines are available from ATCC . 
Typically, HAT-sensit ive mouse myeloma cells are fused to 
mouse splenocytes using polyethylene glycol ("PEG"). 
Hybridoma cells resulting from the fusion are then 

2 0 selected using HAT medium, which kills unfused and 

unproductively fused myeloma cells (unfused splenocytes 
die after several days because they are not transformed) . 
Hybridoma cells producing a monoclonal antibody of the 
invention are detected by screening the hybridoma culture 
25 supernatants for antibodies that bind CARD-3 or CARD-4, 
e.g., using a standard ELISA assay. 

Alternative to preparing monoclonal 
antibody-secreting hybridomas, a monoclonal anti-CARD-3 
or CARD-4 antibody can be identified and isolated by 

3 0 screening a recombinant combinatorial immunoglobulin 

library (e.g., an antibody phage display library) with 
CARD-3 or CARD-4 to thereby isolate immunoglobulin 
library members that bind CARD-3 or CARD-4. Kits for 
generating and screening phage display libraries are 
35 commercially available (e.g., the Pharmacia Recombinant 
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Phage Antibody System, Catalog No. 27-9400-01; and the 
Stratagene Surf ZAP Phage Display Kit, Catalog No. 
240612) . Additionally, examples of methods and reagents 
particularly amenable for use in generating and screening 
5 antibody display library can be found in, for example, 
U.S. Patent No. 5,223,409; PCT Publication No. WO 
92/18619; PCT Publication No. WO 91/17271; PCT 
Publication No. WO 92/20791; PCT Publication No. WO 
92/15679; PCT Publication No. WO 93/01288; PCT 

10 Publication No. WO 92/01047; PCT Publication No. WO 

92/09690; PCT Publication No. WO 90/02809; Fuchs et al . 
(1991) Bio/Technology 9:1370-1372; Hay et al . (1992) Hum. 
Antibod. Hybridomas 3:81-85; Huse et al . (1989) Science 
246:1275-1281; Griffiths et al . (1993) EMBO J 12:725-734. 

15 Additionally, recombinant anti-CARD-3 or CARD-4 

antibodies, such as chimeric and humanized monoclonal 
antibodies, comprising both human and non-human portions, 
which can be made using standard recombinant DNA 
techniques, are within the scope of the invention. Such 

2 0 chimeric and humanized monoclonal antibodies can be 

produced by recombinant DNA techniques known in the art , 
for example using methods described in PCT Publication 
No. WO 87/02671; European Patent Application 184,187; 
European Patent Application 171,496; European Patent 

25 Application 173,494; PCT Publication No. WO 86/01533; 
U.S. Patent No. 4,816,567; European Patent Application 
125,023; Better et al . (1988) Science 240:1041-1043; Liu 
et al. (1987) Proc . Natl. Acad. Sci . USA 84:3439-3443; 
Liu et al . (1987) J. Immunol. 139:3521-3526; Sun et al . 

30 (1987) Proc. Natl. Acad. Sci. USA 84:214-218; Nishimura 
et al. (1987) Cane. Res. 47:999-1005; Wood et al . (1985) 
Nature 314:446-449; and Shaw et al . (1988) J. Natl. 
Cancer Inst. 80:1553-1559); Morrison, (1985) Science 
229:1202-1207; Oi et al . (1986) Bio/Techniques 4:214; 

35 U.S. Patent 5,225,539; Jones et al . (1986) Nature 
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321:552-525; Verhoeyan et al . (1988) Science 239:1534; 
and Beidler et al . (1988) J. Immunol. 141:4053-4060. 

An anti-CARD-3 or CARD-4 antibody (e.g., 
monoclonal antibody) can be used to isolate CARD-3 or 
5 CARD - 4 by standard techniques, such as affinity 

chromatography or immunoprecipitat ion . An anti-CARD-3 or 
CARD-4 antibody can facilitate the purification of 
natural CARD-3 or CARD-4 from cells and of recombinantly 
produced CARD-3 or CARD-4 expressed in host cells. 

10 Moreover, an anti-CARD-3 or CARD-4 antibody can be used 
to detect CARD-3 or CARD-4 protein (e.g., in a cellular 
lysate or cell supernatant) in order to evaluate the 
abundance and pattern of expression of the CARD-3 or 
CARD-4 protein. Anti-CARD-3 or CARD-4 antibodies can be 

15 used diagnostically to monitor protein levels in tissue 
as part of a clinical testing procedure, e.g., to, for 
example, determine the efficacy of a given treatment 
regimen. Detection can be facilitated by coupling the 
antibody to a detectable substance. Examples of 

20 detectable substances include various enzymes, prosthetic 
groups, fluorescent materials, luminescent materials, 
bioluminescent materials, and radioactive materials. 
Examples of suitable enzymes include horseradish 
peroxidase, alkaline phosphatase, £-galactosidase , or 

25 acetylcholinesterase; examples of suitable prosthetic 
group complexes include streptavidin/biotin and 
avidin/biotin; examples of suitable fluorescent materials 
include umbellif erone , fluorescein, fluorescein 
isothiocyanate , rhodamine , dichlorotriazinylamine 

30 fluorescein, dansyl chloride or phycoerythrin; an example 
of a luminescent material includes luminol ; examples of 
bioluminescent materials include lucif erase, luciferin, 
and aequorin, and examples of suitable radioactive 
material include 125 1 , 131 I, 35 S or 3 H. 
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III. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to 
vectors, preferably expression vectors, containing a 
nucleic acid encoding CARD-3 or CARD-4 (or a portion 
5 thereof) . As used herein, the term "vector" refers to a 
nucleic acid molecule capable of transporting another 
nucleic acid to which it has been linked. One type of 
vector is a "plasmid" , which refers to a circular double 
stranded DNA loop into which additional DNA segments can 

10 be ligated. Another type of vector is a viral vector, 
wherein additional DNA segments can be ligated into the 
viral genome. Certain vectors are capable of autonomous 
replication in a host cell into which they are introduced 
(e.g., bacterial vectors having a bacterial origin of 

15 replication and episomal mammalian vectors) . Other 
vectors (e.g., non-episomal mammalian vectors) are 
integrated into the genome of a host cell upon 
introduction into the host cell, and thereby are 
replicated along with the host genome. Moreover, certain 

2 0 vectors, expression vectors, are capable of directing the 

expression of genes to which they are operatively linked. 
In general, expression vectors of utility in recombinant 
DNA techniques are often in the form of plasmids 
(vectors) . However, the invention is intended to include 
25 such other forms of expression vectors, such as viral 
vectors (e.g., replication defective retroviruses, 
adenoviruses and adeno-associated viruses) , which serve 
equivalent functions . 

The recombinant expression vectors of the 

3 0 invention comprise a nucleic acid of the invention in a 

form suitable for expression of the nucleic acid in a 
host cell, which means that the recombinant expression 
vectors include one or more regulatory sequences, 
selected on the basis of the host cells to be used for 
35 expression, which is operatively linked to the nucleic 
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acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean 
that the nucleotide sequence of interest is linked to the 
regulatory sequence (s) in a manner which allows for 
5 expression of the nucleotide sequence (e.g., in an in 
vitro transcription/translation system or in a host cell 
when the vector is introduced into the host cell) . The 
term "regulatory sequence" is intended to include 
promoters, enhancers and other expression control 

10 elements (e.g., polyadenylation signals). Such 

regulatory sequences are described, for example, in 
Goeddel; Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, CA (1990). 
Regulatory sequences include those which direct 

15 constitutive expression of a nucleotide sequence in many 
types of host cell and those which direct expression of 
the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences) . It will be 
appreciated by those skilled in the art that the design 

20 of the expression vector can depend on such factors as 
the choice of the host cell to be transformed, the level 
of expression of protein desired, etc. The expression 
vectors of the invention can be introduced into host 
cells to thereby produce proteins or peptides, including 

25 fusion proteins or peptides, encoded by nucleic acids as 
described herein (e.g., CARD-3 or CARD-4 proteins, mutant 
forms of CARD-3 or CARD-4, fusion proteins, etc.). 

The recombinant expression vectors of the 
invention can be designed for expression of CARD-3 or 

30 CARD-4 in prokaryotic or eukaryotic cells, e.g., 

bacterial cells such as E. coli, insect cells (using 
baculovirus expression vectors) yeast cells or mammalian 
cells. Suitable host cells are discussed further in 
Goeddel, Gene Expression Technology: Methods in 

35 Enzymology 185, Academic Press, San Diego, CA (1990) . 
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Alternatively, the recombinant expression vector can be 
transcribed and translated in vitro, for example using T7 
promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most 
5 often carried out in E. coli with vectors containing 
constitutive or inducible promoters directing the 
expression of either fusion or non-fusion proteins. 
Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the 

10 recombinant protein. Such fusion vectors typically serve 
three purposes: 1) to increase expression of recombinant 
protein; 2) to increase the solubility of the recombinant 
protein; and 3) to aid in the purification of the 
recombinant protein by acting as a ligand in affinity 

15 purification. Often, in fusion expression vectors, a 
proteolytic cleavage site is introduced at the junction 
of the fusion moiety and the recombinant protein to 
enable separation of the recombinant protein from the 
fusion moiety subsequent to purification of the fusion 

2 0 protein. Such enzymes, and their cognate recognition 

sequences, include Factor Xa, thrombin and enterokinase . 
Typical fusion expression vectors include pGEX (Pharmacia 
Biotech Inc; Smith and Johnson (1988) Gene 67:31-40), 
pMAL (New England Biolabs, Beverly, MA) and pRIT5 

2 5 (Pharmacia, Piscataway, NJ) which fuse glutathione 
S-transf erase (GST) , maltose E binding protein, or 
protein A, respectively, to the target recombinant 
protein . 

Examples of suitable inducible non- fusion E. coli 
30 expression vectors include pTrc (Amann et al . , (1988) 
Gene 69:301-315) and pET lid (Studier et al . , Gene 
Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, California (1990) 60-89) . 
Target gene expression from the pTrc vector relies on 
35 host RNA polymerase transcription from a hybrid trp-lac 
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fusion promoter. Target gene expression from the pET lid 
vector relies on transcription from a T7 gnlO-lac fusion 
promoter mediated by a coexpressed viral RNA polymerase 
(T7 gnl) . This viral polymerase is supplied by host 
5 strains BL21(DE3) or HMS174(DE3) from a resident ? 
prophage harboring a T7 gnl gene under the 
transcriptional control of the lacUVS promoter. 

One strategy to maximize recombinant protein 
expression in E. coli is to express the protein in a host 

10 bacteria with an impaired capacity to proteolyt ically 
cleave the recombinant protein (Gottesman, Gene 
Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, California (1990) 119-128) . 
Another strategy is to alter the nucleic acid sequence of 

15 the nucleic acid to be inserted into an expression vector 
so that the individual codons for each amino acid are 
those preferentially utilized in E. coli (Wada et al . 
(1992) Nucleic Acids Res. 20:2111-2118). Such alteration 
of nucleic acid sequences of the invention can be carried 

20 out by standard DNA synthesis techniques. 

In another embodiment, the CARD-3 or CARD-4 
expression vector is a yeast expression vector. Examples 
of vectors for expression in yeast S. cerivisae include 
pYepSecl (Baldari et al . (1987) EMBO J. 6:229-234), pMFa 

25 (Kurjan and Herskowitz, (1982) Cell 30:933-943), pJRY88 
(Schultz et al . (1987) Gene 54:113-123), pYES2 
(Invitrogen Corporation, San Diego, CA) , pGBT9 (Clontech, 
Palo Alto, CA) , pGADIO (Clontech, Palo Alto, CA) , pYADE4 
and pYGAE2 and pYPGE2 (Brunelli and Pall, (1993) Yeast 

30 9:1299-1308), pYPGE15 (Brunelli and Pall, (1993) Yeast 

9:1309-1318), pACTII (Dr. S.E. Elledge, Baylor College of 
Medicine) , and picZ (InVitrogen Corp, San Diego, CA) . 
For example, in Example 7 the expression of a fusion 
protein comprising amino acids 1-145 of human CARD-4L 

3 5 fused to the DNA-binding domain of S. cerevisiae 
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transcription factor GAL4 from the yeast expression 
vector pGBT9 is described. In another example, Example 8 
describes the expression of a fusion protein comprising 
amino acids 4 06-953 of human CARD-4L fused to the 
5 DNA-binding domain of S. cerevisiae transcription factor 
GAL4 from the yeast expression vector pGBT9 . In yet 
another example, Example 7 describes the expression of a 
fusion protein comprising CARD-3 fused to the 
transcriptional activation domain of S. cerevisiae 

10 transcription factor GAL4 from the yeast expression 
vector pACTII . 

Alternatively, CARD-3 or CARD-4 can be expressed 
in insect cells using baculovirus expression vectors. 
Baculovirus vectors available for expression of proteins 

15 in cultured insect cells (e.g., Sf 9 cells) include the 
pAc series (Smith et al . (1983) Mol . Cell Biol. 
3:2156-2165) and the pVL series (Lucklow and Summers 
(1989) Virology 170:31-39). 

In yet another embodiment, a nucleic acid of the 

20 invention is expressed in mammalian cells using a 
mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed (1987) Nature 
329:840), pCI (Promega) , and pMT2PC (Kaufman et al . 
(1987) EMBO J. 6:187-195). When used in mammalian cells, 

25 the expression vector's control functions are often 
provided by viral regulatory elements. For example, 
commonly used promoters are derived from polyoma, 
Adenovirus 2, cytomegalovirus and Simian Virus 40. For 
other suitable expression systems for both prokaryotic 

3 0 and eukaryotic cells see chapters 16 and 17 of Sambrook 
et al. (supra). For example, Example 9, Example 10, and 
Example 12 describe the expression of human CARD-4 or 
fragments therof, CARD-3, or both from the mammalian 
expression vector pCI . 
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| In another embodiment, the recombinant mammalian 

expression vector is capable of directing expression of 
the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to 
5 express the nucleic acid) . Tissue-specific regulatory 
elements are known in the art. Non-limiting examples of 
' suitable tissue- specif ic promoters include the albumin 

promoter ( liver- specif ic ; Pinkert et al . (1987) Genes 
Dev. 1:268-277), lymphoid- speci fic promoters (Calame and 
10 Eaton (1988) Adv. Immunol. 43:235-275), in particular 
promoters of T cell receptors (Winoto and Baltimore 

(1989) EMBO J. 8:729-733) and immunoglobulins (Banerji et 
al. (1983) Cell 33:729-740; Queen and Baltimore (1983) 
Cell 33:741-748), neuron- specif ic promoters (e.g., the 

15 neurofilament promoter; Byrne and Ruddle (1989) Proc. 
Natl. Acad. Sci . USA 86:5473-5477), pancreas-specific 
promoters (Edlund et al . (1985) Science 230:912-916), and 
mammary gland- specif ic promoters (e.g., milk whey 
promoter; U.S. Patent No. 4,8 73,316 and European 

20 Application Publication No. 264,166). 

Developmentally-regulated promoters are also encompassed, 
for example the murine hox promoters (Kessel and Gruss 

(1990) Science 249:374-379) and the a- fetoprotein 
promoter (Campes and Tilghman (1989) Genes Dev. 

25 3 :537-546) . 

The invention further provides a recombinant 
expression vector comprising a DNA molecule of the 
invention cloned into the expression vector in an 
antisense orientation. That is, the DNA molecule is 
, 3 0 operatively linked to a regulatory sequence in a manner 

which allows for expression (by transcription of the DNA 
molecule) of an RNA molecule which is antisense to CARD-3 

i 

or CARD-4 mRNA. Regulatory sequences operatively linked 
j to a nucleic acid cloned in the antisense orientation can 

3 5 be chosen which direct the continuous expression of the 
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antisense RNA molecule in a variety of cell types, for 
instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen which direct constitutive, tissue 
specific or cell type specific expression of antisense 
5 RNA. The antisense expression vector can be in the form 
of a recombinant plasmid, phagemid or attenuated virus in 
which antisense nucleic acids are produced under the 
control of a high efficiency regulatory region, the 
activity of which can be determined by the cell type into 
10 which the vector is introduced. For a discussion of the 
regulation of gene expression using antisense genes see 
Weintraub et al . (Reviews - Trends in Genetics, Vol. 1(1) 
1986) . 

Another aspect of the invention pertains to host 
15 cells into which a recombinant expression vector of the 
invention or isolated nucleic acid molecule of the 
invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. 
It is understood that such terms refer not only to the 
20 particular subject cell but to the progeny or potential 
progeny of such a cell. Because certain modifications 
may occur in succeeding generations due to either 
mutation or environmental influences, such progeny may 
not, in fact, be identical to the parent cell, but are 

2 5 still included within the scope of the term as used 

herein . 

A host cell can be any prokaryotic or eukaryotic 
cell. For example, CARD- 3 or CARD -4 protein can be 
expressed in bacterial cells such as E. coli, insect 

3 0 cells, yeast or mammalian cells (such as Chinese hamster 

ovary cells (CHO) or COS cells) . Other suitable host 
cells are known to those skilled in the art. For 
example, in Example 7 a Saccharomyces cerevisiae host 
cell for recombinant CARD-4 and CARD-3 expression is 
35 described and in Examples 9, 10, and 12 293T host cells 
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for expression of CARD4 or fragments thereof or CARD- 3 
are described. 

Vector DNA or an isolated nucleic acid molecule of 
the invention can be introduced into prokaryotic or 
eukaryotic cells via conventional transformation or 
transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer 
to a variety of art - recognized techniques for introducing 
foreign nucleic acid (e.g., DNA) into a host cell, 
including calcium phosphate or calcium chloride 
co -precipitation, DEAE-dextran-mediated transfection, 
lipofection, or electroporation . Suitable methods for 
transforming or transfecting host cells can be found in 
Sambrook, et al . (supra), and other laboratory manuals. 

For stable transfection of mammalian cells, it is 
known that, depending upon the expression vector and 
transfection technique used, only a small fraction of 
cells may integrate the foreign DNA into their genome. 
IN some cases vector DNA is retained by the host cell. 
In other cases the host cell does not retain vector DNA 
and retains only an isolated nucleic acid molecule of the 
invention carried by the vector. In some cases, and 
isolated nucleic acid molecule of the invention is used 
to transform a cell without the use of a vector. 

In order to identify and select these integrants, 
a gene that encodes a selectable marker (e.g., resistance 
to antibiotics) is generally introduced into the host 
cells along with the gene of interest. Preferred 
selectable markers include those which confer resistance 
to drugs, such as G418, hygromycin and methotrexate. 
Nucleic acid encoding a selectable marker can be 
introduced into a host cell on the same vector as that 
encoding CARD- 3 or CARD- 4 or can be introduced on a 
separate vector. Cells stably transfected with the 
introduced nucleic acid can be identified by drug 
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selection (e.g., cells that have incorporated the 
selectable marker gene will survive, while the other 
cells die) . 

A host cell of the invention, such as a 
5 prokaryotic or eukaryotic host cell in culture, can be 
used to produce a (i.e., express) CARD-3 or CARD-4 
protein. Accordingly, the invention further provides 
methods for producing CARD-3 or CARD-4 protein using the 
host cells of the invention. In one embodiment, the 

10 method comprises culturing the host cell of the invention 
(into which a recombinant expression vector or isolated 
nucleic acid molecule encoding CARD-3 or CARD-4 has been 
introduced) in a suitable medium such that CARD-3 or 
CARD-4 protein is produced. In another embodiment, the 

15 method further comprises isolating CARD-3 or CARD-4 from 
the medium or the host cell. 

The host cells of the invention can also be used 
to produce nonhuman transgenic animals. For example, in 
one embodiment, a host cell of the invention is a 

20 fertilized oocyte or an embryonic stem cell into which 
CARD-3 or CARD-4 -coding sequences have been introduced. 
Such host cells can then be used to create non-human 
transgenic animals in which exogenous CARD-3 or CARD-4 
sequences have been introduced into their genome or 

2 5 homologous recombinant animals in which endogenous CARD-3 
or CARD-4 sequences have been altered. Such animals are 
useful for studying the function and/or activity of 
CARD-3 or CARD-4 and for identifying and/or evaluating 
modulators of CARD-3 or CARD-4 activity. As used herein, 

30 a "transgenic animal" is a non-human animal, preferably a 
mammal, more preferably a rodent such as a rat or mouse, 
in which one or more of the cells of the animal includes 
a transgene. Other examples of transgenic animals 
include non-human primates, sheep, dogs, cows, goats, 

35 chickens, amphibians, etc. A transgene is exogenous DNA 
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which is integrated into the genome of a cell from which 
a transgenic animal develops and which remains in the 
genome of the mature animal, thereby directing the 
expression of an encoded gene product in one or more cell 
5 types or tissues of the transgenic animal. As used 

herein, an "homologous recombinant animal 11 is a non-human 
animal, preferably a mammal, more preferably a mouse, in 
which an endogenous CARD-3 or CARD-4 gene has been 
altered by homologous recombination between the 

10 endogenous gene and an exogenous DNA molecule introduced 
into a cell of the animal, e.g., an embryonic cell of the 
animal, prior to development of the animal. 

A transgenic animal of the invention can be 
created by introducing CARD- 3 or CARD-4 -encoding nucleic 

15 acid into the male pronuclei of a fertilized oocyte, 
e.g., by microinjection, retroviral infection, and 
allowing the oocyte to develop in a pseudopregnant female 
foster animal. The CARD-3 or CARD-4 cDNA sequence e.g., 
that of (SEQ ID NO : 1 , SEQ ID NO : 3 , SEQ ID NO : 7 , SEQ ID 

2 0 NO: 9, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 38, SEQ ID 

NO: 40, SEQ ID NO: 42 or the cDNA of ATCC , or the 

cDNA of ATCC , or the cDNA of ATCC ) can be 

introduced as a transgene into the genome of a non-human 
animal. Alternatively, a nonhuman homolog or ortholog of 

2 5 the human CARD-3 or CARD-4 gene, such as a mouse CARD-3 

or CARD-4 gene, can be isolated based on hybridization to 
the human CARD-3 or CARD-4 cDNA and used as a transgene. 
For example, the mouse ortholog of CARD-4, Figure 15 and 
SEQ ID NO: 42 can be used to make a transgenic animal 
30 using standard methods. Intronic sequences and 

polyadenylation signals can also be included in the 
transgene to increase the efficiency of expression of the 
transgene. A tissue-specific regulatory sequence (s) can 
be operably linked to the CARD-3 or CARD-4 transgene to 

3 5 direct expression of CARD-3 or CARD-4 protein to 
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particular cells. Methods for generating transgenic 
animals via embryo manipulation and microinjection, 
particularly animals such as mice, have become 
conventional in the art and are described, for example, 
5 in U.S. Patent Nos . 4,736,866 and 4,870,009, U.S. Patent 
No. 4,873,191 and in Hogan, Manipulating the Mouse 
Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y., 1986) . Similar methods are used for 
production of other transgenic animals. A transgenic 

10 founder animal can be identified based upon the presence 
of the CARD-3 or CARD-4 transgene in its genome and/or 
expression of CARD-3 or CARD-4 mRNA in tissues or cells 
of the animals. A transgenic founder animal can then be 
used to breed additional animals carrying the transgene. 

15 Moreover, transgenic animals carrying a transgene 

encoding CARD-3 or CARD-4 can further be bred to other 
transgenic animals carrying other transgenes. 

To create an homologous recombinant animal, a 
vector is prepared which contains at least a portion of a 

20 CARD-3 or CARD-4 gene (e.g., a human or a non-human 
homolog of the CARD-3 or CARD-4 gene, e.g., a murine 
CARD-3 or CARD-4 gene) into which a deletion, addition or 
substitution has been introduced to thereby alter, e.g., 
functionally disrupt, the CARD-3 or CARD-4 gene. In an 

25 embodiment, the vector is designed such that, upon 

homologous recombination, the endogenous CARD-3 or CARD-4 
gene is functionally disrupted (i.e., no longer encodes a 
functional protein; also referred to as a "knock out" 
vector) . Alternatively, the vector can be designed such 

30 that, upon homologous recombination, the endogenous 

CARD-3 or CARD-4 gene is mutated or otherwise altered but 
still encodes functional protein (e.g., the upstream 
regulatory region can be altered to thereby alter the 
expression of the endogenous CARD-3 or CARD-4 protein) . 

35 In the homologous recombination vector, the altered 
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portion of the CARD- 3 or CARD- 4 gene is flanked at its 5' 
and 3' ends by additional nucleic acid of the CARD-3 or 
CARD-4 gene to allow for homologous recombination to 
occur between the exogenous CARD-3 or CARD-4 gene carried 
5 by the vector and an endogenous CARD - 3 or CARD-4 gene in 
an embryonic stem cell. The additional flanking CARD-3 
or CARD-4 nucleic acid is of sufficient length for 
successful homologous recombination with the endogenous 
gene. Typically, several kilobases of flanking DNA (both 

10 at the 5' and 3' ends) are included in the vector (see, 
e.g., Thomas and Capecchi (1987) Cell 51:503 for a 
description of homologous recombination vectors) . The 
vector is introduced into an embryonic stem cell line 
(e.g., by electroporat ion) and cells in which the 

15 introduced CARD-3 or CARD-4 gene has homologously 

recombined with the endogenous CARD-3 or CARD-4 gene are 
selected (see, e.g., Li et al . (1992) Cell 69:915). The 
selected cells are then injected into a blastocyst of an 
animal (e.g., a mouse) to form aggregation chimeras (see, 

20 e.g., Bradley in Teratocarcinomas and Embryonic Stem 

Cells: A Practical Approach, Robertson, ed. (IRL, Oxford, 
1987) pp. 113-152) . A chimeric embryo can then be 
implanted into a suitable pseudopregnant female foster 
animal and the embryo brought to term. Progeny harboring 

25 the homologously recombined DNA in their germ cells can 
be used to breed animals in which all cells of the animal 
contain the homologously recombined DNA by germline 
transmission of the transgene. Methods for constructing 
homologous recombination vectors and homologous 

3 0 recombinant animals are described further in Bradley 

(1991) Current Opinion in Bio/Technology 2:823-829 and in 
PCT Publication Nos . WO 90/11354, WO 91/01140, WO 
92/0968, and WO 93/04169. 

In another embodiment, transgenic non-humans 

3 5 animals can be produced which contain selected systems 
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which allow for regulated expression of the transgene. 
One example of such a system is the cre/loxP recombinase 
system of bacteriophage Pi. For a description of the 
cre/loxP recombinase system, see, e.g., Lakso et al . 
5 (1992) Proc. Natl. Acad. Sci . USA 89:6232-623 6. Another 
example of a recombinase system is the FLP recombinase 
system of Saccharomyces cerevisiae (O' Gorman et al . 
(1991) Science 251:1351-1355. If a cre/loxP recombinase 
system is used to regulate expression of the transgene, 

10 animals containing transgenes encoding both the Cre 

recombinase and a selected protein are required. Such 
animals can be provided through the construction of 
"double" transgenic animals, e.g., by mating two 
transgenic animals, one containing a transgene encoding a 

15 selected protein and the other containing a transgene 
encoding a recombinase. 

Clones of the non-human transgenic animals 
described herein can also be produced according to the 
methods described in Wilmut et al . (1997) Nature 

20 385:810-813 and PCT Publication Nos . WO 97/07668 and WO 
97/07669. In brief, a cell, e.g., a somatic cell, from 
the transgenic animal can be isolated and induced to exit 
the growth cycle and enter Go phase. The quiescent cell 
can then be fused, e.g., through the use of electrical 

2 5 pulses, to an enucleated oocyte from an animal of the 

same species from which the quiescent cell is isolated. 
The reconstructed oocyte is then cultured such that it 
develops to morula or blastocyte and then transferred to 
pseudopregnant female foster animal. The offspring borne 

3 0 of this female foster animal will be a clone of the 

animal from which the cell, e.g., the somatic cell, is 
isolated. 
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IV. Pharmaceutical Compositions 

The CARD-3 or CARD-4 nucleic acid molecules, 
CARD -3 or CARD-4 proteins, and ant i- CARD- 3 or CARD-4 
antibodies (also referred to herein as "active 
5 compounds") of the invention can be incorporated into 
pharmaceutical compositions suitable for administration. 
Such compositions typically comprise the nucleic acid 
molecule, protein, or antibody and a pharmaceutical^ 
acceptable carrier. As used herein the language 

10 "pharmaceutical^ acceptable carrier" is intended to 

include any and all solvents, dispersion media, coatings, 
antibacterial and antifungal agents, isotonic and 
absorption delaying agents, and the like, compatible with 
pharmaceutical administration. The use of such media and 

15 agents for pharmaceut ically active substances is well 
known in the art. Except insofar as any conventional 
media or agent is incompatible with the active compound, 
use thereof in the compositions is contemplated. 
Supplementary active compounds can also be incorporated 

20 into the compositions. 

A pharmaceutical composition of the invention is 
formulated to be compatible with its intended route of 
administration. Examples of routes of administration 
include parenteral, e.g., intravenous, intradermal, 

25 subcutaneous, oral (e.g., inhalation), transdermal 
(topical) , transmucosal, and rectal administration. 
Solutions or suspensions used for parenteral, 
intradermal, or subcutaneous application can include the 
following components: a sterile diluent such as water for 

30 injection, saline solution, fixed oils, polyethylene 

glycols, glycerine, propylene glycol or other synthetic 
solvents; antibacterial agents such as benzyl alcohol or 
methyl parabens ; antioxidants such as ascorbic acid or 
sodium bisulfite; chelating agents such as 

35 ethylenediaminetetraacetic acid; buffers such as 
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acetates, citrates or phosphates and agents for the 
adjustment of tonicity such as sodium chloride or 
dextrose. pH can be adjusted with acids or bases, such 
as hydrochloric acid or sodium hydroxide. The parenteral 
5 preparation can be enclosed in ampoules, disposable 

syringes or multiple dose vials made of glass or plastic. 

Pharmaceutical compositions suitable for 
injectable use include sterile aqueous solutions (where 
water soluble) or dispersions and sterile powders for the 

10 extemporaneous preparation of sterile injectable 

solutions or dispersion. For intravenous administration, 
suitable carriers include physiological saline, 
bacteriostatic water, Cremophor EL? (BASF; Parsippany, 
NJ) or phosphate buffered saline (PBS) . In all cases, 

15 the composition must be sterile and should be fluid to 
the extent that easy syringability exists. It must be 
stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action of 
microorganisms such as bacteria and fungi. The carrier 

2 0 can be a solvent or dispersion medium containing, for 
example, water, ethanol , polyol (for example, glycerol, 
propylene glycol, and liquid polyetheylene glycol, and 
the like), and suitable mixtures thereof. The proper 
fluidity can be maintained, for example, by the use of a 

25 coating such as lecithin, by the maintenance of the 

required particle size in the case of dispersion and by 
the use of surfactants. Prevention of the action of 
microorganisms can be achieved by various antibacterial 
and antifungal agents, for example, parabens, 

30 chlorobutanol , phenol, ascorbic acid, thimerosal , and the 
like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars, polyalcohols such 
as mannitol, sorbitol, sodium chloride in the 
composition. Prolonged absorption of the injectable 
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compositions can be brought about by including in the 
composition an agent which delays absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by 
5 incorporating the active compound (e.g., a CARD-3 or 

CARD-4 protein or anti-CARD-3 or CARD-4 antibody) in the 
required amount in an appropriate solvent with one or a 
combination of ingredients enumerated above, as required, 
followed by filtered sterilization. Generally, 

10 dispersions are prepared by incorporating the active 

compound into a sterile vehicle which contains a basic 
dispersion medium and the required other ingredients from 
those enumerated above. In the case of sterile powders 
for the preparation of sterile injectable solutions, the 

15 preferred methods of preparation are vacuum drying and 
f reeze-drying which yields a powder of the active 
ingredient plus any additional desired ingredient from a 
previously sterile-filtered solution thereof. 

Oral compositions generally include an inert 

2 0 diluent or an edible carrier. They can be enclosed in 
gelatin capsules or compressed into tablets. For the 
purpose of oral therapeutic administration, the active 
compound can be incorporated with excipients and used in 
the form of tablets, troches, or capsules. Oral 

25 compositions can also be prepared using a fluid carrier 
for use as a mouthwash, wherein the compound in the fluid 
carrier is applied orally and swished and expectorated or 
swallowed. Pharmaceut ically compatible binding agents, 
and/or adjuvant materials can be included as part of the 

30 composition. The tablets, pills, capsules, troches and 
the like can contain any of the following ingredients, or 
compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an 
excipient such as starch or lactose, a disintegrating 

35 agent such as alginic acid, Primogel, or corn starch; a 
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lubricant such as magnesium stearate or Sterotes; a 
glidant such as colloidal silicon dioxide; a sweetening 
agent such as sucrose or saccharin; or a flavoring agent 
such as peppermint, methyl salicylate, or orange 
5 flavoring. For administration by inhalation, the 

compounds are delivered in the form of an aerosol spray 
from pressured container or dispenser which contains a 
suitable propellant, e.g., a gas such as carbon dioxide, 
or a nebulizer. 

10 Systemic administration can also be by 

transmucosal or transdermal means. For transmucosal or 
transdermal administration, penetrants appropriate to the 
barrier to be permeated are used in the f ormulation. 
Such penetrants are generally known in the art, and 

15 include, for example, for transmucosal administration, 
detergents, bile salts, and fusidic acid derivatives. 
Transmucosal administration can be accomplished through 
the use of nasal sprays or suppositories. For 
transdermal administration, the active compounds are 

2 0 formulated into ointments, salves, gels, or creams as 

generally known in the art. 

The compounds can also be prepared in the form of 
suppositories (e.g., with conventional suppository bases 
such as cocoa butter and other glycerides) or retention 
25 enemas for rectal delivery. 

In one embodiment, the active compounds are 
prepared with carriers that will protect the compound 
against rapid elimination from the body, such as a 
controlled release formulation, including implants and 

3 0 microencapsulated delivery systems. Biodegradable, 

biocompatible polymers can be used, such as ethylene 
vinyl acetate, polyanhydrides , polyglycolic acid, 
collagen, polyorthoesters , and polylactic acid. Methods 
for preparation of such formulations will be apparent to 
35 those skilled in the art. The materials can also be 
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obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including 
liposomes targeted to infected cells with monoclonal 
antibodies to viral antigens) can also be used as 
5 pharmaceutical^ acceptable carriers. These can be 

prepared according to methods known to those skilled in 
the art, for example, as described in U.S. Patent No. 
4 , 522 , 811 . 

It is especially advantageous to formulate oral or 
10 parenteral compositions in dosage unit form for ease of 
administration and uniformity of dosage. Dosage unit 
form as used herein refers to physically discrete units 
suited as unitary dosages for the subject to be treated; 
each unit containing a predetermined quantity of active 
15 compound calculated to produce the desired therapeutic 
effect in association with the required pharmaceutical 
carrier. The specification for the dosage unit forms of 
the invention are dictated by and directly dependent on 
the unique characteristics of the active compound and the 
2 0 particular therapeutic effect to be achieved, and the 
limitations inherent in the art of compounding such an 
active compound for the treatment of individuals. 

The nucleic acid molecules of the invention can be 
inserted into vectors and used as gene therapy vectors . 

2 5 Gene therapy vectors can be delivered to a subject by, 

for example, intravenous injection, local administration 
(U.S. Patent 5,328,470) or by stereotactic injection 
(see, e.g., Chen et al . (1994) Proc . Natl. Acad. Sci . USA 
91:3054-3057). The pharmaceutical preparation of the 

3 0 gene therapy vector can include the gene therapy vector 

in an acceptable diluent, or can comprise a slow release 
matrix in which the gene delivery vehicle is imbedded. 
Alternatively, where the complete gene delivery vector 
can be produced intact from recombinant cells, e.g. 
35 retroviral vectors, the pharmaceutical preparation can 
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include one or more cells which produce the gene delivery 
system. 

The pharmaceutical compositions can be included in 
a container, pack, or dispenser together with 
5 instructions for administration. 

V. Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein 
homologues, and antibodies described herein can be used 
in one or more of the following methods: a) screening 

10 assays; b) detection assays (e.g., chromosomal mapping, 
tissue typing, forensic biology) , c) predictive medicine 
(e.g., diagnostic assays, prognostic assays, monitoring 
clinical trials, and pharmacogenomics) ; and d) methods of 
treatment (e.g., therapeutic and prophylactic). A CARD-3 

15 or CARD-4 protein interacts with other cellular proteins 
and can thus be used for (i) regulation of cellular 
proliferation; (ii) regulation of cellular 
differentiation; and (iii) regulation of cell survival. 
The isolated nucleic acid molecules of the invention can 

20 be used to express CARD-3 or CARD-4 protein (e.g., via a 
recombinant expression vector in a host cell in gene 
therapy applications) , to detect CARD-3 or CARD-4 mRNA 
(e.g., in a biological sample) or a genetic lesion in a 
CARD-3 or CARD-4 gene, and to modulate CARD-3 or CARD-4 

25 activity. In addition, the CARD-3 or CARD-4 proteins can 
be used to screen drugs or compounds which modulate the 
CARD-3 or CARD-4 activity or expression as well as to 
treat disorders characterized by insufficient or 
excessive production of CARD-3 or CARD-4 protein or 

3 0 production of CARD-3 or CARD-4 protein forms which have 
decreased or aberrant activity compared to CARD-3 or 
CARD-4 wild type protein. In addition, the anti-CARD-3 
or CARD-4 antibodies of the invention can be used to 
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detect and isolate CARD- 3 or CARD- 4 proteins and modulate 
CARD -3 or CARD - 4 activity. 

This invention further pertains to novel agents 
identified by the above-described screening assays and 
5 uses thereof for treatments as described herein. 

A. Screening Assays 

The invention provides a method (also referred to 
herein as a "screening assay") for identifying 
modulators, i.e., candidate or test compounds or agents 

10 (e.g., peptides, pept idomimetics , small molecules or 

other drugs) which bind to CARD-3 or CARD-4 proteins or 
biologically active portions thereof or have a 
stimulatory or inhibitory effect on, for example, CARD-3 
or CARD-4 expression or CARD-3 or CARD-4 activity. An 

15 example of a biologically active portion of human CARD-4 
is amino acids 1-14 5 encoding the CARD domain which is 
sufficient to exhibit CARD-3 -binding activity as 
described in Example 7. Amino acids 406-953 of human 
CARD4L comprising the LRR domain represent a biologically 

20 active portion of CARD - 4 L because they possess 

hNUDC-binding activity as described in Example 8. 

In one embodiment, the invention provides assays 
for screening candidate or test compounds which bind to 
or modulate the activity of a CARD-3 or CARD-4 proteins 

25 or polypeptides or biologically active portions thereof. 
The test compounds of the present invention can be 
obtained using any of the numerous approaches in 
combinatorial library methods known in the art, 
including: biological libraries; spatially addressable 

30 parallel solid phase or solution phase libraries ; 

synthetic library methods requiring deconvolution; the 
"one-bead one-compound" library method; and synthetic 
library methods using affinity chromatography selection. 
The biological library approach is limited to peptide 
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libraries, while the other four approaches are applicable 
to peptide, non-peptide oligomer or small molecule 
libraries of compounds (Lam (1997) Anticancer Drug Des . 
12:145) . Examples of methods for the synthesis of 
5 molecular libraries can be found in the art, for example 
in: DeWitt et al . (1993) Proc. Natl. Acad. Sci. U.S.A. 
90:6909; Erb et al . (1994) Proc. Natl. Acad. Sci. USA 
91:11422; Zuckermann et al . (1994). J. Med. Chem. 
37:2678; Cho et al . (1993) Science 261:1303; Carrell et 

10 al. (1994) Angew. Chem. Int. Ed. Engl. 33:2059; Carell et 
al . (1994) Angew. Chem. Int. Ed. Engl. 33:2061; and 
Gallop et al. (1994) J. Med. Chem. 37:1233. 

Libraries of compounds may be presented in 
solution (e.g., Houghten (1992) Bio/Techniques 

15 13:412-421), or on beads (Lam (1991) Nature 354:82-84), 
chips (Fodor (1993) Nature 364:555-556), bacteria (U.S. 
Patent No. 5,223,409), spores (Patent Nos . 5,571,698; 
5,403,484; and 5,223,409), plasmids (Cull et al . (1992) 
Proc. Natl. Acad. Sci. USA 89:1865-1869) or on phage 

20 (Scott and Smith (1990) Science 249:386-390; Devlin 

(1990) Science 249:404-406; Cwirla et al . (1990) Proc. 
Natl. Acad. Sci. 87:6378-6382; and Felici (1991) J. Mol . 
Biol . 222 :301-310) . 

Determining the ability of the test compound to 

25 modulate the activity of CARD- 3 or CARD-4 or a 

biologically active portion thereof can be accomplished, 
for example, by determining the ability of the CARD-3 or 
CARD-4 protein to bind to or interact with a CARD-3 or 
CARD-4 target molecule. As used herein, a "target 

30 molecule" is a molecule with which a CARD-3 or CARD-4 
protein binds or interacts in nature, for example, a 
molecule associated with the internal surface of a cell 
membrane or a cytoplasmic molecule. A CARD-3 or CARD-4 
target molecule can be a non-CARD-3 or CARD-4 molecule or 

35 a CARD-3 or CARD-4 protein or polypeptide of the present 
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invention. In one embodiment, a CARD- 3 or CARD- 4 target 
molecule is a component of an apoptotic signal 
transduction pathway, e.g., CARD-3 and CARD-4. The 
target, for example, can be a second intracellular 
5 protein which has catalytic activity or a protein which 
facilitates the association of downstream signaling 
molecules with CARD -3 or CARD-4 . In another embodiment, 
CARD-3 or CARD-4 target molecules include CARD-3 because 
CARD - 3 was found to bind to CARD-4 (Examples 7 and 12) 
10 and hNUDC because hNUDC was found to bind to CARD-4 
(Example 8) . 

Determining the ability of the test compound to 
modulate the activity of CARD-3 or CARD-4 or a 
biologically active portion thereof can be accomplished, 

15 for example, by determining the ability of the CARD-3 or 
CARD-4 protein to bind to or interact with any of the 
specific proteins listed in the previous paragraph as 
CARD-3 or CARD-4 target molecules. In another 
embodiment, CARD-3 or CARD-4 target molecules include all 

20 proteins that bind to a CARD-3 or CARD-4 protein or 

fragment thereof in a two-hybrid system binding assay 
which can be used without undue experimentation to 
isolate such proteins from cDNA or genomic two-hybrid 
system libraries. For example, Example 7 describes the 

25 use of the CARD-4 CARD domain region to identify CARD-3 
in a two-hybrid screen and Example 8 describes the use of 
the CARD-4 LRR region to identify hNUDC in a two-hybrid 
screen. The binding assays described in this section 
could be cell -based or cell free (described 

3 0 subsequently) . 

Determining the ability of the CARD-3 or CARD-4 
protein to bind to or interact with a CARD-3 or CARD-4 
target molecule can be accomplished by one of the methods 
described above for determining direct binding. In an 

35 embodiment, determining the ability of the CARD-3 or 
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CARD-4 protein to bind to or interact with a CARD-3 or 
CARD-4 target molecule can be accomplished by determining 
the activity of the target molecule. For example, the 
activity of the target molecule can be determined by 
5 detecting induction of a cellular second messenger of the 
target (e.g., intracellular Ca2 + , diacylglycerol , IP3, 
etc.), detecting catalytic/enzymatic activity of the 
target on an appropriate substrate, detecting the 
induction of a reporter gene (e.g., a CARD-3 or 

10 CARD-4 -responsive regulatory element operatively linked 
to a nucleic acid encoding a detectable marker, e.g. 
lucif erase) , or detecting a cellular response, for 
example, cell survival, cellular differentiation, or cell 
proliferation. For example, in Example 12 CARD-4 is 

15 shov/n to bind to CARD-3 and in Example 10, by monitoring 
a cellular response, CARD-4 is shown to enhance caspase 9 
activity, cell death or apoptosis. Because CARD-3 and 
CARD-4 enhance caspase 9 activity, CARD-3 or CARD-4 
activity can be monitored by assaying the caspase 

20 9-mediated apoptosis cellular response or caspase 9 
enzymatic activity. In addition, and in another 
embodiment, genes induced by CARD-3 or CARD-4 expression 
could be identified by expressing CARD-3 or CARD-4 in a 
cell line and conducting a transcriptional profiling 

2 5 experiment wherein the mRNA expression patterns of the 

cell line transformed with an empty expression vector and 
the cell line transformed with a CARD-3 or CARD-4 
expression vector are compared. The promoters of genes 
induced by CARD-3 or CARD-4 expression could be 

3 0 operatively linked to reporter genes suitable for 

screening such as lucif erase , secreted alkaline 
phosphatase, or beta-galactosidase and the resulting 
constructs could be introduced into appropriate 
expression vectors. A recombinant cell line containing 
35 CARD-3 or CARD-4 and transfected with an expression 
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vector containing a CARD- 3 or CARD -4 responsive promoter 
operatively linked to a reporter gene could be used to 
identify test compounds that modulate CARD-3 or CARD-4 
activity by assaying the expression of the reporter gene 
5 in response to contacting the recombinant cell line with 
test compounds. CARD-3 or CARD-4 agonists can be 
identified as increasing the expression of the reporter 
gene and CARD-3 or CARD-4 antagonists can be identified 
as decreasing the expression of the reporter gene. 

10 In another embodiment of the invention, the 

ability of a test compound to modulate the activity of 
CARD-3, CARD-4, or biologically active portions thereof 
can be determined by assaying the ability of the test 
compound to modulate CARD-3 or CARD-4 -dependent pathways 

15 or processes where the CARD-3 or CARD-4 target proteins 
that mediate the CARD-3 or CARD-4 effect are known or 
unknown. Potential CARD-3 or CARD-4 -dependent pathways 
or processes include but are not limited to the 
modulation of cellular signal transduction pathways and 

20 their related second messenger molecules (e.g., 

intracellular Ca2 + , diacylglycerol , IP3 , cAMP etc.), 
cellular enzymatic activities, cellular responses (e.g. , 
cell survival, cellular differentiation, or cell 
proliferation) , or the induction or repression of 

25 cellular or heterologous mRNAs or proteins. CARD-3 or 
CARD-4 -dependent pathways or processes could be assayed 
by standard cell -based or cell free assays appropriate 
for the specific pathway or process under study. For 
example, Example 9 describes how expression of CARD-4S or 

30 CARD-4L in 293T cells induces the NF-kB pathway as 

determined by the measurement of a cotransf ected NF-/cB 
pathway lucif erase reporter gene. In another embodiment, 
cells cotransf ected with CARD-4 and the NF-kB luciferase 
reporter gene could be contacted with a test compound and 

35 test compounds that block CARD-4 activity could be 
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identified by their reduction of CARD- 4 -dependent NF-kB 
pathway lucif erase reporter gene expression. Test 
compounds that agonize CARD- 4 would be expected to 
increase reporter gene expression. In another 
5 embodiment, CARD- 4 could be expressed in a cell line and 
the recombinant CARD-4 -expressing cell line could be 
contacted with a test compound. Test compounds that 
inhibit CARD-4 activity could be indentified by their 
reduction of CARD-4 -depended NF-kB pathway stimulation as 

10 measured by the assay of a NF-kB pathway reporter gene, 
NF-kB nuclear localization, IkB phosphorylation or 
proteolysis, or other standard assays for NF-kB pathway 
activation known to those skilled in the art. 

In yet another embodiment, an assay of the present 

15 invention is a cell -free assay comprising contacting a 
CARD-3 or CARD-4 protein or biologically active portion 
thereof with a test compound and determining the ability 
of the test compound to bind to the CARD-3 or CARD-4 
protein or biologically active portion thereof. Binding 

20 of the test compound to the CARD-3 or CARD-4 protein can 
be determined either directly or indirectly as described 
above. In one embodiment, a competitive binding assay 
includes contacting the CARD-3 or CARD-4 protein or 
biologically active portion thereof with a compound known 

25 to bind CARD-3 or CARD-4 to form an assay mixture, 

contacting the assay mixture with a test compound, and 
determining the ability of the test compound to interact 
with a CARD-3 or CARD-4 protein, wherein determining the 
ability of the test compound to interact with a CARD-3 or 

30 CARD-4 protein comprises determining the ability of the 
test compound to preferentially bind to CARD-3 or CARD-4 
or biologically active portion thereof as compared to the 
known binding compound. 

In another embodiment, an assay is a cell -free 

35 assay comprising contacting CARD-3 or CARD-4 protein or 
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biologically active portion thereof with a test compound 
and determining the ability of the test compound to 
modulate (e.g., stimulate or inhibit) the activity of the 
CARD-3 or CARD-4 protein or biologically active portion 
5 thereof. Determining the ability of the test compound to 
modulate the activity of CARD-3 or CARD-4 can be 
accomplished, for example, by determining the ability of 
the CARD- 3 or CARD-4 protein to bind to a CARD-3 or 
CARD-4 target molecule by one of the methods described 

10 above for determining direct binding. In an alternative 
embodiment, determining the ability of the test compound 
to modulate the activity of CARD-3 or CARD-4 can be 
accomplished by determining the ability of the CARD-3 or 
CARD-4 protein to further modulate a CARD-3 or CARD-4 

15 target molecule. For example, the catalytic/enzymatic 
activity of the target molecule on an appropriate 
substrate can be determined as previously described. 

In yet another embodiment, the cell -free assay 
comprises contacting the CARD-3 or CARD-4 protein or 

20 biologically active portion thereof with a known compound 
which binds CARD-3 or CARD-4 to form an assay mixture, 
contacting the assay mixture with a test compound, and 
determining the ability of the test compound to interact 
with a CARD-3 or CARD-4 protein, wherein determining the 

25 ability of the test compound to interact with a CARD-3 or 
CARD-4 protein comprises determining the ability of the 
CARD-3 or CARD-4 protein to preferentially bind to or 
modulate the activity of a CARD-3 or CARD-4 target 
molecule. The cell -free assays of the present invention 

3 0 are amenable to use of both the soluble form or the 
membrane-associated form of CARD-3 or CARD-4. A 
membrane-associated form of CARD-3 or CARD-4 refers to 
CARD-3 or CARD-4 that interacts with a membrane -bound 
target molecule. In the case of cell-free assays 

3 5 comprising the membrane-associated form of CARD-3 or 
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CARD-4, it may be desirable to utilize a solubilizing 
agent such that the membrane-associated form of CARD-3 or 
CARD-4 is maintained in solution. Examples of such 
solubilizing agents include non-ionic detergents such as 
n-octylglucoside, n-dodecylglucoside , n-dodecylmaltoside , 
octanoyl -N-methylglucamide , decanoyl -N-methylglucamide , 
Triton® X-100, Triton® X-114, Thesit®, 
Isotridecypoly (ethylene glycol ether) n, 
3- [ (3 -cholamidopropyl ) dimethylamminio] -1 -propane 
sulfonate (CHAPS) , 

3- [ ( 3 -cholamidopropyl ) dimethylamminio] -2 -hydroxy- 1 -propan 
e sulfonate (CHAPSO) , or 

N-dodecyl=N, N- dimethyl -3 -ammonio-1 -propane sulfonate . 

In more than one embodiment of the above assay 
methods of the present invention, it may be desirable to 
immobilize either CARD-3 or CARD-4 or its target molecule 
to facilitate separation of complexed from uncomplexed 
forms of one or both of the proteins, as well as to 
accommodate automation of the assay. Binding of a test 
compound to CARD-3 or CARD-4, or interaction of CARD-3 or 
CARD-4 with a target molecule in the presence and absence 
of a candidate compound, can be accomplished in any 
vessel suitable for containing the reactants. Examples 
of such vessels include microtitre plates, test tubes, 
and micro-centrifuge tubes. In one embodiment, a fusion 
protein can be provided which adds a domain that allows 
one or both of the proteins to be bound to a matrix. For 
example, glutathione- S- transferase/ CARD-3 or CARD-4 
fusion proteins or glutathione-S- transf erase/target 
fusion proteins can be adsorbed onto glutathione 
sepharose beads (Sigma Chemical; St. Louis, MO) or 
glutathione derivatized microtitre plates, which are then 
combined with the test compound or the test compound and 
either the non-adsorbed target protein or CARD-3 or 
CARD-4 protein, and the mixture incubated under 
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conditions conducive to complex formation (e.g., at 
physiological conditions for salt and pH) . Following 
incubation, the beads or microtitre plate wells are 
washed to remove any unbound components, the matrix 
5 immobilized in the case of beads, complex determined 

either directly or indirectly, for example, as described 
above. Alternatively, the complexes can be dissociated 
from the matrix, and the level of CARD - 3 or CARD- 4 
binding or activity determined using standard techniques. 

10 In an alternative embodiment, MYC or HA epitope tag 

CARD-3 or CARD-4 fusion proteins or MYC or HA epitope tag 
target fusion proteins can be adsorbed onto ant i -MYC or 
ant i -HA antibody coated microbeads or onto ant i -MYC or 
anti-HA antibody coated microtitre plates, which are then 

15 combined with the test compound or the test compound and 
either the non-adsorbed target protein or CARD-3 or 
CARD-4 protein, and the mixture incubated under 
conditions conducive to complex formation (e.g., at 
physiological conditions for salt and pH) . Following 

20 incubation, the beads or microtitre plate wells are 
washed to remove any unbound components, the matrix 
immobilized in the case of beads, complex determined 
either directly or indirectly, for example, as described 
above. Alternatively, the complexes can be dissociated 

25 from the matrix, and the level of CARD-3 or CARD-4 

binding or activity determined using standard techniques. 
Example 12 describes an HA epitope tagged CARD-4 protein 
that physically interacts in a coimmunoprecipitat ion 
assay with MYC epitope tagged CARD-3. In an embodiment 

30 of the invention, HA epitope tagged CARD-4 could be used 
in combination with MYC epitope CARD-3 in the sort of 
protein-protein interaction assay described earlier in 
this paragraph . 

Other techniques for immobilizing proteins on 

35 matrices can also be used in the screening assays of the 
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invention. For example, either CARD- 3 or CARD-4 or its 
target molecule can be immobilized utilizing conjugation 
of biotin and streptavidin . Biotinylated CARD-3 or 
CARD-4 or target molecules can be prepared from 
5 biotin-NHS (N-hydroxy-succinimide) using techniques well 
known in the art (e.g., biotinylation kit, Pierce 
Chemicals; Rockford, IL) , and immobilized in the wells of 
streptavidin- coated 96 well plates (Pierce Chemical) . 
Alternatively, antibodies reactive with CARD-3 or CARD-4 

10 or target molecules but which do not interfere with 
binding of the CARD-3 or CARD-4 protein to its target 
molecule can be derivatized to the wells of the plate, 
and unbound target or CARD-3 or CARD-4 trapped in the 
wells by antibody conjugation. Methods for detecting 

15 such complexes, in addition to those described above for 
the GST- immobilized complexes and epitope tag immobilized 
complexes, include immunodetection of complexes using 
antibodies reactive with the CARD-3 or CARD-4 or target 
molecule, as well as enzyme-linked assays which rely on 

20 detecting an enzymatic activity associated with the 
CARD-3 or CARD-4 or target molecule. 

In another embodiment, modulators of CARD-3 or 
CARD-4 expression are identified in a method in which a 
cell is contacted with a candidate compound and the 

25 expression of the CARD-3 or CARD-4 promoter, mRNA or 
protein in the cell is determined. The level of 
expression of CARD-3 or CARD-4 mRNA or protein in the 
presence of the candidate compound is compared to the 
level of expression of CARD-3 or CARD-4 mRNA or protein 

3 0 in the absence of the candidate compound. The candidate 
compound can then be identified as a modulator of CARD-3 
or CARD-4 expression based on this comparison. For 
example, when expression of CARD-3 or CARD-4 mRNA or 
protein is greater (statistically significantly greater) 

3 5 in the presence of the candidate compound than in its 
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absence, the candidate compound is identified as a 
stimulator of CARD- 3 or CARD- 4 mRNA or protein 
expression. Alternatively, when expression of CARD-3 or 
CARD-4 mRNA or protein is less (statistically 
significantly less) in the presence of the candidate 
compound than in its absence, the candidate compound is 
identified as an inhibitor of CARD- 3 or CARD-4 mRNA or 
protein expression. The level of CARD-3 or CARD-4 mRNA 
or protein expression in the cells can be determined by 
methods described herein for detecting CARD-3 or CARD-4 
mRNA or protein. The activity of the CARD-3 or CARD-4 
promoter can be assayed by linking the CARD-3 or CARD-4 
promoter to a reporter gene such as lucif erase, secreted 
alkaline phosphatase, or beta-galactosidase and 
introducing the resulting construct into an appropriate 
vector, transfecting a host cell line, and measuring the 
activity of the reporter gene in response to test 
compounds. For example, two CARD-4 -specif ic mRNAs were 
detected in a Northern blotting experiment, one of 4.6 
kilobases and the other of 6.5-7.0 kilobases (Example 
11) . In Example 11, CARD-4 -specif ic mRNA species were 
found to be widely distributed in the tissues and cell 
lines studied. 

In yet another aspect of the invention, the CARD-3 
or CARD-4 proteins can be used as "bait proteins" in a 
two-hybrid assay or three hybrid assay (see, e.g., U.S. 
Patent No. 5,283,317; Zervos et al . (1993) Cell 
72:223-232; Madura et al . (1993) J. Biol. Chem. 
268:12046-12054; Bartel et al . (1993) Bio/Techniques 
14:920-924; Iwabuchi et al . (1993) Oncogene 8:1693-1696; 
and PCT Publication No. WO 94/10300) , to identify other 
proteins, which bind to or interact with CARD-3 or CARD-4 
("CARD-3 or CARD-4 -binding proteins" or "CARD-3 or 
CARD-4 -bp") and modulate CARD-3 or CARD-4 activity. Such 
CARD-3 or CARD-4 -binding proteins are also likely to be 
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involved in the propagation of signals by the CARD- 3 or 
CARD -4 proteins as, for example, upstream or downstream 
elements of the CARD -3 or CARD -4 pathway. For example, 
Example 7 describes the construction of a two-hybrid 
5 screening bait construct including human CARD-4L amino 
acids 1-14 5 comprising the CARD domain and the use of 
this bait construct to screen human mammary gland and 
prostate gland two-hybrid libraries resulting in the 
identification of human CARD-3 as a CARD-4 interacting 

10 protein- In another example, Example 8 describes the 
construction of a two-hybrid screening bait construct 
including human CARD-4 amino acids 4 06-953 comprising the 
LRR domain and the use of this bait construct to screen a 
human mammary gland two -hybrid libraries resulting in the 

15 identification of hNUDC as a CARD-4 interacting protein. 

The two-hybrid system is based on the modular 
nature of most transcription factors, which consist of 
separable DNA-binding and activation domains. Briefly, 
the assay utilizes two different DNA constructs. In one 

20 construct, the gene that codes for CARD-3 or CARD-4 is 
fused to a gene encoding the DNA binding domain of a 
known transcription factor (e.g., GAL- 4) . In the other 
construct, a DNA sequence, from a library of DNA 
sequences, that encodes an unidentified protein ("prey" 

25 or "sample") is fused to a gene that codes for the 

activation domain of the known transcription factor. If 
the "bait" and the "prey" proteins are able to interact, 
in vivo, forming an CARD-3 or CARD-4 -dependent complex, 
the DNA-binding and activation domains of the 

3 0 transcription factor are brought into close proximity. 
This proximity allows transcription of a reporter gene 
(e.g., LacZ) which is operably linked to a 
transcriptional regulatory site responsive to the 
transcription factor. Expression of the reporter gene 

35 can be detected and cell colonies containing the 
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functional transcription factor can be isolated and used 
to obtain the cloned gene which encodes the protein which 
interacts with CARD-3 or CARD-4 . 

In an embodiment of the invention, the ability of 
5 a test compound to modulate the activity of CARD-3, 

CARD-4, or a biologically active portion thereof can be 
determined by assaying the ability of the test compound 
to block the binding of CARD-3 and CARD-4 to their target 
proteins in a two-hybrid system assay. Example 7 

10 describes a two-hybrid system assay for the interaction 
between CARD-3 and CARD-4 and Example 8 describes a 
two-hybrid system assay for the interaction between 
CARD-4 and its target protein hNUDC. To screen for test 
compounds that block the interaction between CARD-3 and 

15 CARD-4 and their target proteins, which include but are 
not limited to CARD-3, CARD-4, and hNUDC, a yeast 
two-hybrid screening strain coexpressing the interacting 
bait and prey constructs, for example, a CARD-4 bait 
construct and a CARD-3 prey construct as described in 

20 Example 7 , is contacted with the test compound and the 
activity of the two-hybrid system reporter gene, usually 
HIS3, lacZ, or URA3 is assayed. If the strain remains 
viable but exhibits a significant decrease in reporter 
gene activity, this would indicate that the test compound 

25 has inhibited the interaction between the bait and prey 
proteins. This assay could be automated for high 
throughput drug screening purposes. In another 
embodiment of the invention, CARD-3 or CARD-4 and their 
target proteins could be configured in the reverse 

30 two-hybrid system (Vidal et al . (1996) Proc . Natl. Acad. 
Sci. USA 93:10321-6 and Vidal et al . (1996) Proc. Natl. 
Acad. Sci. USA 93:10315-20) designed specifically for 
efficient drug screening. In the reverse two-hybrid 
system, inhibition of a CARD-3 or CARD-4 physical 

35 interaction with a target protein would result in 
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induction of a reporter gene in contrast to the normal 
two-hybrid system where inhibition of CARD-3 or CARD-4 
physical interaction with a target protein would lead to 
reporter gene repression. The reverse two-hybrid system 
5 is preferred for drug screening because reporter gene 
induction is more easily assayed than reporter gene 
repression. 

Alternative embodiments of the invention are 
proteins found to physically interact with proteins that 

10 bind to CARD-3 or CARD-4. CARD-3 or CARD-4 interactors, 
including but not limited to hNUDC and CARD-3, could be 
configured into two-hybrid system baits and used in 
two-hybrid screens to identify additional members of the 
CARD-3 and CARD-4 pathway. The interactors of CARD-3 or 

15 CARD-4 interactors identified in this way could be useful 
targets for therapeutic intervention in CARD-4 related 
diseases and pathologies and an assay of their enzymatic 
or binding activity could be useful for the 
identification of test compounds that modulate CARD-3 or 

20 CARD-4 activity. 

This invention further pertains to novel agents 
identified by the above -described screening assays and 
uses thereof for treatments as described herein. 

B. Detection Assays 

25 Portions or fragments of the cDNA sequences 

identified herein (and the corresponding complete gene 
sequences) can be used in numerous ways as polynucleotide 
reagents. For example, these sequences can be used to: 
(i) map their respective genes on a chromosome; and, 

30 thus, locate gene regions associated with genetic 
disease; (ii) identify an individual from a minute 
biological sample (tissue typing) ; and (iii) aid in 
forensic identification of a biological sample. These 
applications are described in the subsections below. 
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1. Chromosome Mapping 

Once the sequence (or a portion of the sequence) 
of a gene has been isolated, this sequence can be used to 
map the location of the gene on a chromosome. 
5 Accordingly, CARD-3 or CARD-4 nucleic acid molecules 

described herein or fragments thereof, can be used to map 
the location of CARD-3 or CARD-4 genes on a chromosome. 
The mapping of the CARD-3 or CARD-4 sequences to 
chromosomes is an important first step in correlating 

10 these sequences with genes associated with disease. 

Briefly, CARD-3 or CARD-4 genes can be mapped to 
chromosomes by preparing PCR primers (preferably 15-25 bp 
in length) from the CARD-3 or CARD-4 sequences. Computer 
analysis of CARD-3 or CARD-4 sequences can be used to 

15 rapidly select primers that do not span more than one 
exon in the genomic DNA, thus complicating the 
amplification process. These primers can then be used 
for PCR screening of somatic cell hybrids containing 
individual human chromosomes . Only those hybrids 

2 0 containing the human gene corresponding to the CARD-3 or 
CARD-4 sequences will yield an amplified fragment. For 
example, in Example 6, human CARD-4 -specif ic PCR primers 
were used to screen DNAs from a somatic cell hybrid panel 
showing that human CARD-4 maps to chromosome 7 close to 

25 the SHGC-31928 genetic marker. 

Somatic cell hybrids are prepared by fusing 
somatic cells from different mammals (e.g., human and 
mouse cells) . As hybrids of human and mouse cells grow 
and divide, they gradually lose human chromosomes in 

30 random order, but retain the mouse chromosomes. By using 
media in which mouse cells cannot grow, because they lack 
a particular enzyme, but human cells can, the one human 
chromosome that contains the gene encoding the needed 
enzyme, will be retained. By using various media, panels 

35 of hybrid cell lines can be established. Each cell line 
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in a panel contains either a single human chromosome or a 
small number of human chromosomes, and a full set of 
mouse chromosomes, allowing easy mapping of individual 
genes to specific human chromosomes. (D'Eustachio et al . 
5 (1983) Science 220:919-924). Somatic cell hybrids 

containing only fragments of human chromosomes can also 
be produced by using human chromosomes with 
translocations and deletions. 

PCR mapping of somatic cell hybrids is a rapid 

10 procedure for assigning a particular sequence to a 

particular chromosome. Three or more sequences can be 
assigned per day using a single thermal cycler. Using 
the CARD-3 or CARD-4 sequences to design oligonucleotide 
primers, sublocalization can be achieved with panels of 

15 fragments from specific chromosomes. Other mapping 

strategies which can similarly be used to map a CARD-3 or 
CARD-4 sequence to its chromosome include in situ 
hybridization (described in Fan et al . (1990) Proc . Natl. 
Acad. Sci. USA 87:6223-27), pre-screening with labeled 

2 0 flow-sorted chromosomes, and pre-selection by 

hybridization to chromosome specific cDNA libraries. 

Fluorescence in situ hybridization (FISH) of a DNA 
sequence to a metaphase chromosomal spread can further be 
used to provide a precise chromosomal location in one 
25 step. Chromosome spreads can be made using cells whose 
division has been blocked in metaphase by a chemical like 
colcemid that disrupts the mitotic spindle. The 
chromosomes can be treated briefly with trypsin, and then 
stained with Giemsa. A pattern of light and dark bands 

3 0 develops on each chromosome, so that the chromosomes can 

be identified individually. The FISH technique can be 
used with a DNA sequence as short as 50 0 or 600 bases. 
However, clones larger than 1,000 bases have a higher 
likelihood of binding to a unique chromosomal location 
35 with sufficient signal intensity for simple detection. 
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Preferably 1,000 bases, and more preferably 2,000 bases 
will suffice to get good results at a reasonable amount 
of time. For a review of this technique, see Verma et 
al . , (Human Chromosomes: A Manual of Basic Techniques 
5 (Pergamon Press, New York, 1988)). 

Reagents for chromosome mapping can be used 
individually to mark a single chromosome or a single site 
on that chromosome, or panels of reagents can be used for 
marking multiple sites and/or multiple chromosomes. 

10 Reagents corresponding to noncoding regions of the genes 
actually are preferred for mapping purposes. Coding 
sequences are more likely to be conserved within gene 
families, thus increasing the chance of cross 
hybridizations during chromosomal mapping. 

15 Once a sequence has been mapped to a precise 

chromosomal location , the physical position of the 
sequence on the chromosome can be correlated with genetic 
map data. (Such data are found, for example, in V. 
McKusick, Mendelian Inheritance in Man, available on-line 

2 0 through Johns Hopkins University Welch Medical Library) . 
The relationship between genes and disease, mapped to the 
same chromosomal region, can then be identified through 
linkage analysis (co-inheritance of physically adjacent 
genes), described in, e.g., Egeland et al . (1987) Nature, 

25 325:783-787. 

Moreover, differences in the DNA sequences between 
individuals affected and unaffected with a disease 
associated with the CARD-3 or CARD-4 gene can be 
determined. If a mutation is observed in some or all of 

30 the affected individuals but not in any unaffected 
individuals, then the mutation is likely to be the 
causative agent of the particular disease. Comparison of 
affected and unaffected individuals generally involves 
first looking for structural alterations in the 

35 chromosomes such as deletions or translocations that are 
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visible from chromosome spreads or detectable using PCR 
based on that DNA sequence. Ultimately, complete 
sequencing of genes from several individuals can be 
performed to confirm the presence of a mutation and to 
5 distinguish mutations from polymorphisms. 
2. Tissue Typing 

The CARD- 3 or CARD- 4 sequences of the present 
invention can also be used to identify individuals from 
minute biological samples. The United States military, 

10 for example, is considering the use of restriction 

fragment length polymorphism (RFLP) for identification of 
its personnel. In this technique, an individual's 
genomic DNA is digested with one or more restriction 
enzymes, and probed on a Southern blot to yield unique 

15 bands for identification. This method does not suffer 
from the current limitations of "Dog Tags" which can be 
lost, switched, or stolen, making positive identification 
difficult. The sequences of the present invention are 
useful as additional DNA markers for RFLP (described in 

20 U.S. Patent 5,272,057). 

Furthermore, the sequences of the present 
invention can be used to provide an alternative technique 
which determines the actual base-by-base DNA sequence of 
selected portions of an individual's genome. Thus, the 

2 5 CARD- 3 or CARD-4 sequences described herein can be used 

to prepare two PCR primers from the 5' and 3' ends of the 
sequences. These primers can then be used to amplify an 
individual's DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from 

3 0 individuals, prepared in this manner, can provide unique 

individual identifications, as each individual will have 
a unique set of such DNA sequences due to allelic 
differences. The sequences of the present invention can 
be used to obtain such identification sequences from 
35 individuals and from tissue. The CARD -3 or CARD-4 
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sequences of the invention uniquely represent portions of 
the human genome. Allelic variation occurs to some 
degree in the coding regions of these sequences, and to a 
greater degree in the noncoding regions. It is estimated 
5 that allelic variation between individual humans occurs 
with a frequency of about once per each 500 bases. Each 
of the sequences described herein can, to some degree, be 
used as a standard against which DNA from an individual 
can be compared for identification purposes. Because 

10 greater numbers of polymorphisms occur in the noncoding 
regions, fewer sequences are necessary to differentiate 
individuals. The noncoding sequences of SEQ ID N0:1, SEQ 
ID NO: 7, SEQ ID NO: 25 and SEQ ID NO: 42 can comfortably 
provide positive individual identification with a panel 

15 of perhaps 10 to 1,000 primers which each yield a 

noncoding amplified sequence of 100 bases. If predicted 
coding sequences, such as those in SEQ ID NO: 3, SEQ ID 
NO: 9, and SEQ ID NO: 27 are used, a more appropriate 
number of primers for positive individual identification 

2 0 would be 500-2,000. 

If a panel of reagents from CARD- 3 or CARD-4 
sequences described herein is used to generate a unique 
identification database for an individual, those same 
reagents can later be used to identify tissue from that 

25 individual. Using the unique identification database, 
positive identification of the individual, living or 
dead, can be made from extremely small tissue samples. 

3, Use of Partial CARD- 3 or CARD-4 Sequences in 
Forensic Biology 

30 DNA-based identification techniques can also be 

used in forensic biology. Forensic biology is a 
scientific field employing genetic typing of biological 
evidence found at a crime scene as a means for positively 
identifying, for example, a perpetrator of a crime. To 

35 make such an identification, PCR technology can be used 
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to amplify DNA sequences taken from very small biological 
samples such as tissues, e.g., hair or skin, or body 
fluids, e.g., blood, saliva, or semen found at a crime 
scene. The amplified sequence can then be compared to a 
5 standard, thereby allowing identification of the origin 
of the biological sample. 

The sequences of the present invention can be used 
to provide polynucleotide reagents, e.g., PCR primers, 
targeted to specific loci in the human genome, which can 

10 enhance the reliability of DNA-based forensic 

identifications by, for example, providing another 
"identification marker" (i.e. another DNA sequence that 
is unique to a particular individual) . As mentioned 
above, actual base sequence information can be used for 

15 identification as an accurate alternative to patterns 
formed by restriction enzyme generated fragments. 
Sequences targeted to noncoding regions of SEQ ID NO:l, 
SEQ ID NO: 7, and SEQ ID NO: 25 are particularly 
appropriate for this use as greater numbers of 

20 polymorphisms occur in the noncoding regions, making it 
easier to differentiate individuals using this technique. 
Examples of polynucleotide reagents include the CARD- 3 or 
CARD-4 sequences or portions thereof, e.g., fragments 
derived from the noncoding regions of SEQ ID NO:l, SEQ ID 

2 5 NO: 7, or SEQ ID NO: 25 which have a length of at least 2 0 
or 30 bases. 

The CARD-3 or CARD-4 sequences described herein 
can further be used to provide polynucleotide reagents, 
e.g., labeled or labelable probes which can be used in, 

30 for example, an in situ hybridization technique, to 

identify a specific tissue, e.g., brain tissue. This can 
be very useful in cases where a forensic pathologist is 
presented with a tissue of unknown origin. Panels of 
such CARD-3 or CARD-4 probes can be used to identify 

35 tissue by species and/or by organ type. 
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In a similar fashion, these reagents, e.g., CARD-3 
or CARD-4 primers or probes can be used to screen tissue 
culture for contamination (i.e., screen for the presence 
of a mixture of different types of cells in a culture) . 

5 C. Predictive Medicine 

The present invention also pertains to the field 
of predictive medicine in which diagnostic assays, 
prognostic assays, pharmacogenomics , and monitoring 
clinical trials are used for prognostic (predictive) 

10 purposes to thereby treat an individual prophylactically . 
Accordingly, one aspect of the present invention relates 
to diagnostic assays for determining CARD-3 or CARD-4 
protein and/or nucleic acid expression as well as CARD-3 
or CARD-4 activity, in the context of a biological sample 

15 (e.g., blood, serum, cells, tissue) to thereby determine 
whether an individual is afflicted with a disease or 
disorder, or is at risk of developing a disorder, 
associated with aberrant CARD-3 or CARD-4 expression or 
activity. The invention also provides for prognostic (or 

20 predictive) assays for determining whether an individual 
is at risk of developing a disorder associated with 
CARD-3 or CARD-4 protein, nucleic acid expression or 
activity. For example, mutations in a CARD-3 or CARD-4 
gene can be assayed in a biological sample. Such assays 

25 can be used for prognostic or predictive purpose to 

thereby prophylactically treat an individual prior to the 
onset of a disorder characterized by or associated with 
CARD-3 or CARD-4 protein, nucleic acid expression or 
activity. 

3 0 Another aspect of the invention provides methods 

for determining CARD-3 or CARD-4 protein, nucleic acid 
expression or CARD-3 or CARD-4 activity in an individual 
to thereby select appropriate therapeutic or prophylactic 
agents for that individual (referred to herein as 
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selection of agents (e.g., drugs) for therapeutic or 
prophylactic treatment of an individual based on the 
genotype of the individual (e.g., the genotype of the 
5 individual examined to determine the ability of the 
individual to respond to a particular agent.) 

Yet another aspect of the invention pertains to 
monitoring the influence of agents (e.g., drugs or other 
compounds) on the expression or activity of CARD- 3 or 
10 CARD-4 in clinical trials. 

These and other agents are described in further 
detail in the following sections. 

1 o Diagnostic Assays 

An exemplary method for detecting the presence or 

15 absence of CARD-3 or CARD-4 in a biological sample 
involves obtaining a biological sample from a test 
subject and contacting the biological sample with a 
compound or an agent capable of detecting CARD-3 or 
CARD-4 protein or nucleic acid (e.g., mRNA, genomic DNA) 

20 that encodes CARD-3 or CARD-4 protein such that the 
presence of CARD-3 or CARD-4 is detected in the 
biological sample. An agent for detecting CARD-3 or 
CARD-4 mRNA or genomic DNA is a labeled nucleic acid 
probe capable of hybridizing to CARD-3 or CARD-4 mRNA or 

25 genomic DNA. The nucleic acid probe can be, for example, 
a full-length CARD-3 or CARD-4 nucleic acid, such as the 
nucleic acid of SEQ ID NO: 1 or 3 , SEQ ID NO: 7 or 9, SEQ 
ID NO:25 or 27, or a portion thereof, such as an 
oligonucleotide of at least 15, 30, 50, 100, 250 or 500 

30 nucleotides in length and sufficient to specifically 

hybridize under stringent conditions to CARD-3 or CARD-4 
mRNA or genomic DNA, or a human CARD-4 splice variant 
such as the nucleic acid of SEQ ID NO: 38 or SEQ ID NO: 40. 
Other suitable probes for use in the diagnostic assays of 

3 5 the invention are described herein. For example, Example 
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11 describes the use of a nucleic acid probe to detect 
CARD-4 mRNAs in human tissues and cell lines and the 
probe used in this experiment could be used for a 
diagnostic assay. 

An agent for detecting CARD- 3 or CARD-4 protein 
can be an antibody capable of binding to CARD- 3 or CARD-4 
protein, preferably an antibody with a detectable label. 
Antibodies can be polyclonal, or more preferably, 
monoclonal. For example, polypeptides corresponding to 
amino acids 128-139 and 287-298 of human CARD - 4 L were 
used to immunize rabbits and produce polyclonal 
antibodies that specifically recognize human CARD-4L. An 
intact antibody, or a fragment thereof (e.g., Fab or 
F(ab')2) can be used. The term "labeled", with regard to 
the probe or antibody, is intended to encompass direct 
labeling of the probe or antibody by coupling (i.e., 
physically linking) a detectable substance to the probe 
or antibody, as well as indirect labeling of the probe or 
antibody by reactivity with another reagent that is 
directly labeled. Examples of indirect labeling include 
detection of a primary antibody using a f luorescently 
labeled secondary antibody and end-labeling of a DNA 
probe with biotin such that it can be detected with 
f luorescently labeled streptavidin . The term "biological 
sample" is intended to include tissues, cells and 
biological fluids isolated from a subject, as well as 
tissues, cells and fluids present within a subject. That 
is, the detection method of the invention can be used to 
detect CARD-3 or CARD-4 mRNA, protein, or genomic DNA in 
a biological sample in vitro as well as in vivo. For 
example, in vitro techniques for detection of CARD-3 or 
CARD-4 mRNA include Northern hybridizations and in situ 
hybridizations. For example, Example 11 contains the use 
of a human CARD - 4 L nucleic acid probe for a Northern 
blotting analysis of mRNA species encoded by human 
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CARD-4L detected in RNA samples from human tissues and 
cell lines. In vitro techniques for detection of CARD-3 
or CARD -4 protein include enzyme linked immunosorbent 
assays (ELISAs) , Western blots, immunoprecipitat ions and 
5 immunofluorescence. In vitro techniques for detection of 
CARD -3 or CARD -4 genomic DNA include Southern 
hybridizations. Furthermore, in vivo techniques for 
detection of CARD-3 or CARD-4 protein include introducing 
into a subject a labeled anti-CARD-3 or CARD-4 antibody. 

10 For example, the antibody can be labeled with a 

radioactive marker whose presence and location in a 
subject can be detected by standard imaging techniques. 

In one embodiment, the biological sample contains 
protein molecules from the test subject. Alternatively, 

15 the biological sample can contain mRNA molecules from the 
test subject or genomic DNA molecules from the test 
subject. An biological sample is a peripheral blood 
leukocyte sample isolated by conventional means from a 
subject. 

20 In another embodiment, the methods further involve 

obtaining a control biological sample from a control 
subject, contacting the control sample with a compound or 
agent capable of detecting CARD-3 or CARD-4 protein, 
mRNA, or genomic DNA, such that the presence of CARD-3 or 

25 CARD-4 protein, mRNA or genomic DNA is detected in the 
biological sample, and comparing the presence of CARD-3 
or CARD-4 protein, mRNA or genomic DNA in the control 
sample with the presence of CARD-3 or CARD-4 protein, 
mRNA or genomic DNA in the test sample. 

3 0 The invention also encompasses kits for detecting 

the presence of CARD-3 or CARD-4 in a biological sample 
(a test sample) . Such kits can be used to determine if a 
subject is suffering from or is at increased risk of 
developing a disorder associated with aberrant expression 

35 of CARD-3 or CARD-4 (e.g., an immunological disorder). 
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For example, the kit can comprise a labeled compound or 
agent capable of detecting CARD-3 or CARD-4 protein or 
mRNA in a biological sample and means for determining the 
amount of CARD-3 or CARD-4 in the sample (e.g., an 
5 anti-CARD-3 or CARD-4 antibody or an oligonucleotide 

probe which binds to DNA encoding CARD-3 or CARD-4, e.g., 
SEQ ID NO:l, SEQ ID NO : 3 , SEQ ID NO : 7 , SEQ ID NO : 9 , SEQ 
ID NO: 25 or SEQ ID NO:27) . Kits may also include 
instruction for observing that the tested subject is 

10 suffering from or is at risk of developing a disorder 
associated with aberrant expression of CARD-3 or CARD-4 
if the amount of CARD-3 or CARD-4 protein or mRNA is 
above or below a normal level . 

For antibody-based kits, the kit may comprise, for 

15 example: (1) a first antibody (e.g., attached to a solid 
support) which binds to CARD-3 or CARD-4 protein; and, 
optionally, (2) a second, different antibody which binds 
to CARD-3 or CARD-4 protein or the first antibody and is 
conjugated to a detectable agent. 

20 For oligonucleotide-based kits, the kit may 

comprise, for example: (1) a oligonucleotide, e.g., a 
detectably labelled oligonucleotide, which hybridizes to 
a CARD-3 or CARD-4 nucleic acid sequence or (2) a pair of 
primers useful for amplifying a CARD-3 or CARD-4 nucleic 

25 acid molecule. 

The kit may also comprise, e.g., a buffering 
agent, a preservative, or a protein stabilizing agent. 
The kit may also comprise components necessary for 
detecting the detectable agent (e.g., an enzyme or a 

30 substrate) . The kit may also contain a control sample or 
a series of control samples which can be assayed and 
compared to the test sample contained. Each component of 
the kit is usually enclosed within an individual 
container and all of the various containers are within a 

3 5 single package along with instructions for observing 
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whether the tested subject is suffering from or is at 
risk of developing a disorder associated with aberrant 
expression of CARD- 3 or CARD -4 . 

2 . Prognostic Assays 
5 The methods described herein can furthermore be 

utilized as diagnostic or prognostic assays to identify 
subjects having or at risk of developing a disease or 
disorder associated with aberrant CARD-3 or CARD-4 
expression or activity. For example, the assays 

10 described herein, such as the preceding diagnostic assays 
or the following assays, can be utilized to identify a 
subject having or at risk of developing a disorder 
associated with CARD-3 or CARD-4 protein, nucleic acid 
expression or activity. Alternatively, the prognostic 

15 assays can be utilized to identify a subject having or at 
risk for developing such a disease or disorder. Thus, 
the present invention provides a method in which a test 
sample is obtained from a subject and CARD-3 or CARD-4 
protein or nucleic acid (e.g., mRNA, genomic DNA) is 

20 detected, wherein the presence of CARD-3 or CARD-4 
protein or nucleic acid is diagnostic for a subject 
having or at risk of developing a disease or disorder 
associated with aberrant CARD-3 or CARD-4 expression or 
activity. As used herein, a "test sample" refers to a 

25 biological sample obtained from a subject of interest. 
For example, a test sample can be a biological fluid 
(e.g., serum), cell sample, or tissue. Furthermore, 
the prognostic assays described herein can be used to 
determine whether a subject can be administered an agent 

30 (e.g., an agonist, antagonist, peptidomimetic, protein, 
peptide, nucleic acid, small molecule, or other drug 
candidate) to treat a disease or disorder associated with 
aberrant CARD-3 or CARD-4 expression or activity. For 
example, such methods can be used to determine whether a 

35 subject can be effectively treated with a specific agent 
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or class of agents (e.g., agents of a type which decrease 
CARD-3 or CARD-4 activity) . Thus, the present invention 
provides methods for determining whether a subject can be 
effectively treated with an agent for a disorder 
5 associated with aberrant CARD-3 or CARD-4 expression or 
activity in which a test sample is obtained and CARD-3 or 
CARD-4 protein or nucleic acid is detected (e.g., wherein 
the presence of CARD-3 or CARD-4 protein or nucleic acid 
is diagnostic for a subject that can be administered the 

10 agent to treat a disorder associated with aberrant CARD-3 
or CARD-4 expression or activity) . 

The methods of the invention can also be used to 
detect genetic lesions or mutations in a CARD-3 or CARD-4 
gene, thereby determining if a subject with the lesioned 

15 gene is at risk for a disorder characterized by aberrant 
cell proliferation and/or differentiation. In preferred 
embodiments, the methods include detecting, in a sample 
of cells from the subject, the presence or absence of a 
genetic lesion characterized by at least one of an 

20 alteration affecting the integrity of a gene encoding a 
CARD-3 or CARD-4 -protein, or the mis-expression of the 
CARD-3 or CARD-4 gene. For example, such genetic lesions 
can be detected by ascertaining the existence of at least 
one of 1) a deletion of one or more nucleotides from a 

25 CARD-3 or CARD-4 gene; 2) an addition of one or more 

nucleotides to a CARD-3 or CARD-4 gene; 3) a substitution 
of one or more nucleotides of a CARD-3 or CARD-4 gene, 4) 
a chromosomal rearrangement of a CARD-3 or CARD-4 gene; 
5) an alteration in the level of a messenger RNA 

3 0 transcript of a CARD-3 or CARD-4 gene, 6) aberrant 

modification of a CARD-3 or CARD-4 gene, such as of the 
methylation pattern of the genomic DNA, 7) the presence 
of a non-wild type splicing pattern of a messenger RNA 
transcript of a CARD-3 or CARD-4 gene (e.g, caused by a 

35 mutation in a splice donor or splice acceptor site), 8) a 
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non-wild type level of a CARD -3 or CARD-4 -protein , 9) 
allelic loss of a CARD-3 or CARD-4 gene, and 10) 
inappropriate post- translat ional modification of a CARD-3 
or CARD-4 -protein. As described herein, there are a 
5 large number of assay techniques known in the art which 
can be used for detecting lesions in a CARD-3 or CARD-4 
gene. A biological sample is a peripheral blood 
leukocyte sample isolated by conventional means from a 
subj ect . 

10 In certain embodiments, detect ion of the lesion 

involves the use of a probe/primer in a polymerase chain 
reaction (PCR) (see, e.g., U.S. Patent Nos . 4,683,195 and 
4,683,202), such as anchor PCR or RACE PCR, or, 
alternatively, in a ligation chain reaction (LCR) (see, 

15 e.g., Landegran et al . (1988) Science 241:1077-1080; and 
Nakazawa et al . (1994) Proc . Natl. Acad. Sci . USA 
91:360-364), the latter of which can be particularly 
useful for detecting point mutations in the CARD-3 or 
CARD-4-gene (see, e.g., Abravaya et al . (1995) Nucleic 

20 Acids Res. 23:675-682). This method can include the 
steps of collecting a sample of cells from a patient, 
isolating nucleic acid (e.g., genomic, mRNA or both) from 
the cells of the sample, contacting the nucleic acid 
sample with one or more primers which specifically 

25 hybridize to a CARD-3 or CARD-4 gene under conditions 
such that hybridization and amplification of the CARD-3 
or CARD-4 -gene (if present) occurs, and detecting the 
presence or absence of an amplification product, or 
detecting the size of the amplification product and 

3 0 comparing the length to a control sample. It is 

anticipated that PCR and/or LCR may be desirable to use 
as a preliminary amplification step in conjunction with 
any of the techniques used for detecting mutations 
described herein. 
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Alternative amplification methods include: self 
sustained sequence replication (Guatelli et al . (1990) 
Proc. Natl. Acad. Sci . USA 87:1874-1878), transcriptional 
amplification system (Kwoh, et al . (1989) Proc. Natl. 
5 Acad. Sci. USA 86:1173-1177), Q-Beta Replicase (Lizardi 
et al . (1988) Bio/Technology 6:1197), or any other 
nucleic acid amplification method, followed by the 
detection of the amplified molecules using techniques 
well known to those of skill in the art. These detection 

10 schemes are especially useful for the detection of 

nucleic acid molecules if such molecules are present in 
very low numbers . 

In an alternative embodiment, mutations in a 
CARD-3 or CARD-4 gene from a sample cell can be 

15 identified by alterations in restriction enzyme cleavage 
patterns. For example, sample and control DNA is 
isolated, amplified (optionally) , digested with one or 
more restriction endonucleases , and fragment length sizes 
are determined by gel electrophoresis and compared. 

2 0 Differences in fragment length sizes between sample and 
control DNA indicates mutations in the sample DNA. 
Moreover, the use of sequence specific ribozymes (see, 
e.g., U.S. Patent No. 5,498,531) can be used to score for 
the presence of specific mutations by development or loss 

25 of a ribozyme cleavage site. 

In other embodiments, genetic mutations in CARD-3 
or CARD- 4 can be identified by hybridizing a sample and 
control nucleic acids, e.g., DNA or RNA, to high density 
arrays containing hundreds or thousands of 

30 oligonucleotides probes (Cronin et al . (1996) Human 

Mutation 7:244-255; Kozal et al . (1996) Nature Medicine 
2:753-759). For example, genetic mutations in CARD-3 or 
CARD-4 can be identified in two-dimensional arrays 
containing light -generated DNA probes as described in 

35 Cronin et al . supra. Briefly, a first hybridization 
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array of probes can be used to scan through long 
stretches of DNA in a sample and control to identify base 
changes between the sequences by making linear arrays of 
sequential overlapping probes. This step allows the 
5 identification of point mutations. This step is followed 
by a second hybridization array that allows the 
characterization of specific mutations by using smaller, 
specialized probe arrays complementary to all variants or 
mutations detected. Each mutation array is composed of 

10 parallel probe sets, one complementary to the wild- type 
gene and the other complementary to the mutant gene. 

In yet another embodiment, any of a variety of 
sequencing reactions known in the art can be used to 
directly sequence the CARD- 3 or CARD- 4 gene and detect 

15 mutations by comparing the sequence of the sample CARD- 3 
or CARD-4 with the corresponding wild- type (control) 
sequence. Examples of sequencing reactions include those 
based on techniques developed by Maxam and Gilbert 
((1977) Proc. Natl. Acad. Sci . USA 74:560) or Sanger 

20 ((1977) Proc. Natl. Acad. Sci. USA 74:5463). It is also 
contemplated that any of a variety of automated 
sequencing procedures can be utilized when performing the 
diagnostic assays ((1995) Bio/Techniques 19:448), 
including sequencing by mass spectrometry (see, e.g., PCT 

25 Publication No. WO 94/16101; Cohen et al . (1996) Adv. 
Chromatogr. 36:127-162; and Griffin et al . (1993) Appl . 
Biochem. Biotechnol . 38:147-159). 

Other methods for detecting mutations in the 
CARD-3 or CARD-4 gene include methods in which protection 

3 0 from cleavage agents is used to detect mismatched bases 
in RNA/RNA or RNA/DNA heteroduplexes (Myers et al . (1985) 
Science 230:1242) • In general, the art technique of 
"mismatch cleavage" starts by providing heteroduplexes of 
formed by hybridizing (labeled) RNA or DNA containing the 

35 wild-type CARD-3 or CARD-4 sequence with potentially 
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mutant RNA or DNA obtained from a tissue sample. The 
double -stranded duplexes are treated with an agent which 
cleaves single-stranded regions of the duplex such as 
which will exist due to basepair mismatches between the 
5 control and sample strands. For instance, RNA /DNA 

duplexes can be treated with RNase and DNA /DNA hybrids 
treated with SI nuclease to enzymatically digesting the 
mismatched regions. In other embodiments, either DNA / DNA 
or RNA / DNA duplexes can be treated with hydroxylamine or 

10 osmium tetroxide and with piperidine in order to digest 
mismatched regions. After digestion of the mismatched 
regions, the resulting material is then separated by size 
on denaturing polyacrylamide gels to determine the site 
of mutation. See, e.g., Cotton et al (1988) Proc . Natl 

15 Acad Sci USA 85:4397; Saleeba et al (1992) Methods 

Enzymol . 217:286-295. In an embodiment, the control DNA 
or RNA can be labeled for detection. 

In still another embodiment, the mismatch cleavage 
reaction employs one or more proteins that recognize 

20 mismatched base pairs in double - stranded DNA (so called 
"DNA mismatch repair" enzymes) in defined systems for 
detecting and mapping point mutations in CARD-3 or CARD-4 
cDNAs obtained from samples of cells. For example, the 
mutY enzyme of E. coli cleaves A at G/A mismatches and 

25 the thymidine DNA glycosylase from HeLa cells cleaves T 
at G/T mismatches (Hsu et al . (1994) Carcinogenesis 
15:1657-1662). According to an exemplary embodiment, a 
probe based on a CARD-3 or CARD-4 sequence, e.g., a 
wild- type CARD-3 or CARD-4 sequence, is hybridized to a 

30 cDNA or other DNA product from a test cell (s) . The 

duplex is treated with a DNA mismatch repair enzyme, and 
the cleavage products, if any, can be detected from 
electrophoresis protocols or the like. See, e.g., U.S. 
Patent No. 5, 459, 039. 
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In other embodiments, alterations in 
electrophoretic mobility will be used to identify 
mutations in CARD-3 or CARD-4 genes. For example, single 
strand conformation polymorphism (SSCP) may be used to 
5 detect differences in electrophoretic mobility between 
mutant and wild type nucleic acids (Orita et al . (1989) 
Proc Natl. Acad. Sci USA: 86:2166, see also Cotton (1993) 
Mutat . Res. 285:125-144; and Hayashi (1992) Genet Anal 
Tech Appl 9:73-79). Single-stranded DNA fragments of 

10 sample and control CARD-3 or CARD-4 nucleic acids will be 
denatured and allowed to renature . The secondary 
structure of single-stranded nucleic acids varies 
according to sequence, the resulting alteration in 
electrophoretic mobility enables the detection of even a 

15 single base change. The DNA fragments may be labeled or 
detected with labeled probes. The sensitivity of the 
assay may be enhanced by using RNA (rather than DNA) , in 
which the secondary structure is more sensitive to a 
change in sequence. In an embodiment, the subject method 

2 0 utilizes heteroduplex analysis to separate double 

stranded heteroduplex molecules on the basis of changes 
in electrophoretic mobility (Keen et al . (1991) Trends 
Genet 7:5). 

In yet another embodiment, the movement of mutant 

2 5 or wild- type fragments in polyacrylamide gels containing 

a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (DGGE) (Myers et al . (1985) 
Nature 313:495). When DGGE is used as the method of 
analysis, DNA will be modified to insure that it does not 

3 0 completely denature, for example by adding a GC clamp of 

approximately 40 bp of high-melting GC-rich DNA by PCR. 
In a further embodiment, a temperature gradient is used 
in place of a denaturing gradient to identify differences 
in the mobility of control and sample DNA (Rosenbaum and 
35 Reissner (1987) Biophys Chem 265:12753). 
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Examples of other techniques for detecting point 
mutations include, but are not limited to, selective 
oligonucleotide hybridization, selective amplification, 
or selective primer extension. For example, 
5 oligonucleotide primers may be prepared in which the 

known mutation is placed centrally and then hybridized to 
target DNA under conditions which permit hybridization 
only if a perfect match is found (Saiki et al . (1986) 
Nature 324:163); Saiki et al . (1989) Proc . Natl Acad. Sci 
10 USA 86:6230). Such allele specific oligonucleotides are 
hybridized to PGR amplified target DNA or a number of 
different mutations when the oligonucleotides are 
attached to the hybridizing membrane and hybridized with 
labeled target DNA. 
15 Alternatively, allele specific amplification 

technology which depends on selective PCR amplification 
may be used in conjunction with the instant invention. 
Oligonucleotides used as primers for specific 
amplification may carry the mutation of interest in the 
20 center of the molecule (so that amplification depends on 
differential hybridization) (Gibbs et al . (1989) Nucleic 
Acids Res. 17:2437-2448) or at the extreme 3' end of one 
primer where, under appropriate conditions, mismatch can 
prevent, or reduce polymerase extension (Prossner (1993) 
25 Tibtech 11:238). In addition, it may be desirable to 
introduce a novel restriction site in the region of the 
mutation to create cleavage-based detection (Gasparini et 
al. (1992) Mol. Cell Probes 6:1). It is anticipated that 
in certain embodiments amplification may also be 
3 0 performed using Taq ligase for amplification (Barany 

(1991) Proc. Natl. Acad. Sci USA 88:189). In such cases, 
ligation will occur only if there is a perfect match at 
the 3' end of the 5' sequence making it possible to 
detect the presence of a known mutation at a specific 
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site by looking for the presence or absence of 
amplification. 

The methods described herein may be performed, for 
example, by utilizing pre-packaged diagnostic kits 
5 comprising at least one probe nucleic acid or antibody 
rea g en t described herein, which may be conveniently used, 
e.g., in clinical settings to diagnose patients 
exhibiting symptoms or family history of a disease or 
illness involving a CARD-3 or CARD-4 gene. 

10 Furthermore, any cell type or tissue, preferably 

peripheral blood leukocytes, in which CARD-3 or CARD-4 is 
expressed may be utilized in the prognostic assays 
described herein. 

3 . Pharmacogenomics 

15 Agents, or modulators which have a stimulatory or 

inhibitory effect on CARD-3 or CARD-4 act ivity (e . g . , 
CARD-3 or CARD-4 gene expression) as identified by a 
screening assay described herein can be administered to 
individuals to treat (prophylactically or 

20 therapeutically) disorders (e.g., an immunological 
disorder) associated with aberrant CARD-3 or CARD-4 
activity. In conjunction with such treatment, the 
pharmacogenomics (i.e., the study of the relationship 
between an individual's genotype and that individual's 

2 5 response to a foreign compound or drug) of the individual 

may be considered. Differences in metabolism of 
therapeutics can lead to severe toxicity or therapeutic 
failure by altering the relation between dose and blood 
concentration of the pharmacologically active drug. Thus, 

3 0 the pharmacogenomics of the individual permits the 

selection of effective agents (e.g., drugs) for 
prophylactic or therapeutic treatments based on a 
consideration of the individual's genotype. Such 
pharmacogenomics can further be used to determine 
3 5 appropriate dosages and therapeutic regimens. 
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Accordingly, the activity of CARD - 3 or CARD - 4 protein, 
expression of CARD-3 or CARD-4 nucleic acid, or mutation 
content of CARD- 3 or CARD-4 genes in an individual can be 
determined to thereby select appropriate agent (s) for 
5 therapeutic or prophylactic treatment of the individual. 

Pharmacogenomics deals with clinically significant 
hereditary variations in the response to drugs due to 
altered drug disposition and abnormal action in affected 
persons. See, e.g., Linder (1997) Clin. Chem. 

10 43 (2 ): 2 54 -266 . In general, two types of pharmacogenet ic 
conditions can be differentiated. Genetic conditions 
transmitted as a single factor altering the way drugs act 
on the body (altered drug action) or genetic conditions 
transmitted as single factors altering the way the body 

15 acts on drugs (altered drug metabolism) . These 

pharmacogenetic conditions can occur either as rare 
defects or as polymorphisms. For example, 

glucose-6-phosphate dehydrogenase deficiency (G6PD) is a 
common inherited enzymopathy in which the main clinical 

20 complication is haemolysis after ingestion of oxidant 
drugs (anti-malarials , sulfonamides, analgesics, 
nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of 
drug metabolizing enzymes is a major determinant of both 

25 the intensity and duration of drug action. The discovery 
of genetic polymorphisms of drug metabolizing enzymes 
(e.g., N-acetyltransf erase 2 (NAT 2) and cytochrome P450 
enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug 

3 0 effects or show exaggerated drug response and serious 
toxicity after taking the standard and safe dose of a 
drug. These polymorphisms are expressed in two 
phenotypes in the population, the extensive metabolizer 
(EM) and poor metabolizer (PM) . The prevalence of PM is 

35 different among different populations. For example, the 
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gene coding for CYP2D6 is highly polymorphic and several 
mutations have been identified in PM, which all lead to 
the absence of functional CYP2D6 . Poor metabolizers of 
CYP2D6 and CYP2C19 quite frequently experience 
5 exaggerated drug response and side effects when they 
receive standard doses. If a metabolite is the active 
therapeutic moiety, PM show no therapeutic response, as 
demonstrated for the analgesic effect of codeine mediated 
by its CYP2D6-f ormed metabolite morphine. The other 

10 extreme are the so called ultra-rapid metabolizers who do 
not respond to standard doses. Recently, the molecular 
basis of ultra-rapid metabolism has been identified to be 
due to CYP2D6 gene amplification. 

Thus, the activity of CARD-3 or CARD-4 protein, 

15 expression of CARD-3 or CARD-4 nucleic a ci d, — or— mutation 
content of CARD-3 or CARD-4 genes in an individual can be 
determined to thereby select appropriate agent (s) for 
therapeutic or prophylactic treatment of the individual. 
In addition, pharmacogenetic studies can be used to apply 

20 genotyping of polymorphic alleles encoding 

drug-metabolizing enzymes to the identification of an 
individual's drug responsiveness phenotype . This 
knowledge, when applied to dosing or drug selection, can 
avoid adverse reactions or therapeutic failure and thus 

25 enhance therapeutic or prophylactic efficiency when 
treating a subject with a CARD-3 or CARD-4 modulator, 
such as a modulator identified by one of the exemplary 
screening assays described herein. 

4. Monitoring of Effects During Clinical Trials 

30 Monitoring the influence of agents (e.g., drugs, 

compounds) on the expression or activity of CARD-3 or 
CARD-4 (e.g., the ability to modulate aberrant cell 
proliferation and/or differentiation) can be applied not 
only in basic drug screening, but also in clinical 

35 trials. For example, the effectiveness of an agent 
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determined by a screening assay as described herein to 
increase CARD- 3 or CARD-4 gene expression, protein 
levels, or upregulate CARD-3 or CARD-4 activity, can be 
monitored in clinical trails of subjects exhibiting 
5 decreased CARD-3 or CARD-4 gene expression, protein 
levels, or downregulated CARD-3 or CARD-4 activity. 
Alternatively, the effectiveness of an agent determined 
by a screening assay to decrease CARD-3 or CARD-4 gene 
expression, protein levels, or downregulated CARD-3 or 

10 CARD-4 activity, can be monitored in clinical trials of 
subjects exhibiting increased CARD-3 or CARD-4 gene 
expression, protein levels, or upregulated CARD-3 or 
CARD-4 activity. In such clinical trials, the expression 
or activity of CARD-3 or CARD-4 and, preferably, other 

15 genes that have been implicated in, for example, a 

cellular proliferation disorder can be used as a "read 
out " or markers of the immune responsiveness of a 
particular cell. 

For example, and not by way of limitation, genes, 

20 including CARD-3 or CARD-4, that are modulated in cells 

by treatment with an agent (e.g., compound, drug or small 
molecule) which modulates CARD-3 or CARD-4 activity 
(e.g., identified in a screening assay as described 
herein) can be identified. Thus, to study the effect of 

25 agents on cellular proliferation disorders, for example, 
in a clinical trial, cells can be isolated and RNA 
prepared and analyzed for the levels of expression of 
CARD-3 or CARD-4 and other genes implicated in the 
disorder. The levels of gene expression (i.e., a gene 

30 expression pattern) can be quantified by Northern blot 

analysis or RT-PCR, as described herein, or alternatively 
by measuring the amount of protein produced, by one of 
the methods as described herein, or by measuring the 
levels of activity of CARD-3 or CARD-4 or other genes. 

35 In this way, the gene expression pattern can serve as a 
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marker, indicative of the physiological response of the 
cells to the agent. Accordingly, this response state may 
be determined before, and at various points during, 
treatment of the individual with the agent. 
5 In an embodiment, the present invention provides a 

method for monitoring the effectiveness of treatment of a 
subject with an agent (e.g., an agonist, antagonist, 
peptidomimetic , protein, peptide, nucleic acid, small 
molecule, or other drug candidate identified by the 

10 screening assays described herein) comprising the steps 
of (i) obtaining a pre-administration sample from a 
subject prior to administration of the agent; (ii) 
detecting the level of expression of a CARD -3 or CARD -4 
protein, mRNA, or genomic DNA in the preadministrat ion 

15 sample; (iii) obtaining one or more post-administration 
samples from the subject; (iv) detecting the level of 
expression or activity of the CARD-3 or CARD-4 protein, 
mRNA, or genomic DNA in the post -administration samples; 
(v) comparing the level of expression or activity of the 

20 CARD-3 or CARD-4 protein, mRNA, or genomic DNA in the 
pre-administration sample with the CARD-3 or CARD-4 
protein, mRNA, or genomic DNA in the post administration 
sample or samples; and (vi) altering the administration 
of the agent to the subject accordingly. For example, 

2 5 increased administration of the agent may be desirable to 
increase the expression or activity of CARD-3 or CARD-4 
to higher levels than detected, i . e . , to increase the 
effectiveness of the agent. Alternatively, decreased 
administration of the agent may be desirable to decrease 

30 expression or activity of CARD-3 or CARD-4 to lower 

levels than detected, i.e., to decrease the effectiveness 
of the agent . 



WO 99/40102 



PCT/US99/02544 



- 116 - 

C. Methods of Treatment 

The present invention provides for both 
prophylactic and therapeutic methods of treating a 
subject at risk of (or susceptible to) a disorder or 
5 having a disorder associated with aberrant CARD-3 or 
CARD-4 expression or activity . 

1. Prophylactic Methods 

In one aspect, the invention provides a method for 
preventing in a subject, a disease or condition 

10 associated with an aberrant CARD-3 or CARD-4 expression 
or activity, by administering to the subject an agent 
which modulates CARD-3 or CARD-4 expression or at least 
one CARD-3 or CARD-4 activity. Subjects at risk for a 
disease which is caused or contributed to by aberrant 

15 CARD-3 or CARD-4 expression or activity can be identified 
by, for example, any or a combination of diagnostic or 
prognostic assays as described herein. Administration of 
a prophylactic agent can occur prior to the manifestation 
of symptoms characteristic of the CARD-3 or CARD-4 

20 aberrancy, such that a disease or disorder is prevented 
or, alternatively, delayed in its progression. Depending 
on the type of CARD-3 or CARD-4 aberrancy, for example, a 
CARD-3 or CARD-4 agonist or CARD-3 or CARD-4 antagonist 
agent can be used for treating the subject. The 

2 5 appropriate agent can be determined based on screening 
assays described herein. Activities of CARD-3 or CARD-4 
that could be modulated for prophylactic purposes 
include, but are not limited to, 1) CARD-3 or CARD-4 gene 
or protein expression, for example, see Example 11 for a 

30 description of the mRNA expression pattern of human 

CARD-4; 2) CARD-3 or CARD-4 binding to a target protein, 
for example, see Examples 7, 8, and 12 for a description 
of proteins known to bind to CARD-3 or CARD-4; 3) CARD-4 
regulation of NF-kB as described in Example 9; and 4) 
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CARD- 3 and CARD-4 enhancement of caspase 9 activity as 
described in Example 10. 

2 . Therapeutic Methods 

Another aspect of the invention pertains to 
5 methods of modulating CARD-3 or CARD-4 expression or 

activity for therapeutic purposes. The modulatory method 
of the invention involves contacting a cell with an agent 
that modulates one or more of the activities of CARD-3 or 
CARD-4 protein activity associated with the cell . An 

10 agent that modulates CARD-3 or CARD-4 protein activity 
can be an agent as described herein, such as a nucleic 
acid or a protein, a naturally-occurring cognate ligand 
of a CARD-3 or CARD-4 protein, a peptide, a CARD-3 or 
CARD-4 peptidomimetic , or other small molecule. In one 

15 embodiment, the agent stimulates one or more of the 
biological activities of CARD-3 or CARD-4 protein. 
Examples of such stimulatory agents include active CARD-3 
or CARD-4 protein and a nucleic acid molecule encoding 
CARD-3 or CARD-4 that has been introduced into the cell. 

2 0 In another embodiment, the agent inhibits one or more of 

the biological activities of CARD-3 or CARD-4 protein. 
Examples of such inhibitory agents include antisense 
CARD-3 or CARD-4 nucleic acid molecules and anti -CARD-3 
or CARD-4 antibodies. These modulatory methods can be 

25 performed in vitro (e.g., by culturing the cell with the 
agent) or, alternatively, in vivo (e.g, by administering 
the agent to a subject) . As such, the present invention 
provides methods of treating an individual afflicted with 
a disease or disorder characterized by aberrant 

30 expression or activity of a CARD-3 or CARD-4 protein or 
nucleic acid molecule. In one embodiment, the method 
involves administering an agent (e.g., an agent 
identified by a screening assay described herein) , or 
combination of agents that modulates (e.g., upregulates 

3 5 or downregulates) CARD-3 or CARD-4 expression or 
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activity. In another embodiment, the method involves 
administering a CARD-3 or CARD-4 protein or nucleic acid 
molecule as therapy to compensate for reduced or aberrant 
CARD-3 or CARD-4 expression or activity. Activities of 
5 CARD-3 or CARD-4 that could be modulated for therapeutic 
purposes include, but are not limited to, 1) CARD-3 or 
CARD-4 gene or protein expression, for example, see 
Example 11 for a description of the mRNA expression 
pattern of human CARD-4; 2) CARD-3 or CARD-4 binding to a 

10 target protein, for example, see Examples 7, 8, and 12 

for a description of proteins known to bind to CARD-3 or 
CARD-4; 3) CARD-4 regulation of NF-kB as described in 
Example 9; and 4) CARD-4 enhancement of caspase 9 
activity as described in Example 10. 

15 Stimulation of CARD-3 or CARD-4 activity is 

desirable in situations in which CARD-3 or CARD-4 is 
abnormally downregulated and/or in which increased CARD-3 
or CARD-4 activity is likely to have a beneficial effect. 
Conversely, inhibition of CARD-3 or CARD-4 activity is 

20 desirable in situations in which CARD-3 or CARD-4 is 

abnormally upregulated, e.g., in myocardial infarction, 
and/or in which decreased CARD-3 or CARD-4 activity is 
likely to have a beneficial effect. Since CARD-4 may 
play be involved in the processing of cytokines, 

25 inhibiting the activity or expression CARD4- may be 

beneficial in patients that have aberrant inflammation. 

This invention is further illustrated by the 
following examples which should not be construed as 
limiting. The contents of all references, patents and 

30 published patent applications cited throughout this 
application are hereby incorporated by reference. 
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EXAMPLES 

Example 1: Isolation and Characterization of 
full- length Human CARD- 3 and CARD-4L/S cDNAs . 

A profile of known CARD domains was used to search 
5 databases of cDNA sequences and partial cDNA sequences 
using TBLASTN (Washington University; version 2.0, 
BLOSUM62 search matix) . This search led to the 
identification of CARD-3. Using CARD-3 to search 

databases of cDNA sequences and partial cDNA sequences, 

10 another potential CARD cDNA was found. This cDNA 
sequence was used screen a human umbilical vein 
endothelial library (HUVE) and a clone containing the 
partial CARD-4S was identified. The human umbilical vein 
endothelial library was then rescreened using a probe 

15 designed against the partial CARD- 4 S sequence and a clone 
containing the CARD-4L sequence was identified. 

Example 2; Characterization of CARD-3 AND 
CARD-4L/S Proteins. 

In this example, the predicted amino acid 

2 0 sequences of human CARD-3 and CARD-4L/S proteins were 
compared to amino acid sequences of known proteins and 
various motifs were identified. For example, the CARD 
domains of CARD-3 and CARD-4 were aligned (Figure 7) with 
the CARD domains of ARC-CARD (SEQ ID NO:31), cIAPl-CARD 

25 (SEQ ID NO: 32) and CIAP2-CARD (SEQ ID NO: 33) . In 

addition, the molecular weight of the human CARD-3 and 
CARD-4L/S proteins were predicted. 

The human CARD-3 cDNA was isolated as described 
above (Figure 1; SEQ ID NO:l) and encodes a 54 0 amino 

30 acid protein (Figure 2: SEQ ID NO:2). CARD-3 also 

includes one predicted kinase domain (amino acid 1 to 
amino acid 300 of SEQ ID NO:2; SEQ ID N0:4), which is 
followed by a predicted linker domain (amino acid 301 to 
amino acid 431 of SEQ ID NO: 2; SEQ ID NO: 5) and a 
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predicted CARD domain (amino acid 432 to amino acid 540 
of SEQ ID NO: 2; SEQ ID NO : 6 ) . 

The human CARD - 4 L cDNA was isolated as described 
above (Figure 3; SEQ ID NO: 7) and has a 2859 nucleotide 
5 open reading frame (nucleotides 245-3103 of SEQ ID NO : 7 ; 
SEQ ID NO: 9) which encodes a 953 amino acid protein 
(Figure 4; SEQ ID NO : 8 ) . CARD- 4 L. protein has a predicted 
CARD domain (amino acids 15-114; SEQ ID NO: 10) . CARD - 4 L 
is also predicted to have a nucleotide binding domain 

10 which extends from about amino acid 198 to about amino 

acid 3 97 of SEQ ID NO : 8 ; SEQ ID NO: 11, a predicted Walker 
Box "A", which extends from about amino acid 202 to about 
amino acid 209 of SEQ ID NO : 8 ; SEQ ID NO: 12, a predicted 
Walker Box "B" , which extends from about amino acid 280 

15 to about amino acid 284, of SEQ ID NO : 8 ; SEQ ID NO: 13, a 
predicted kinase la (P-loop) domain, which extends from 
about amino acid 197 to about amino acid 212 of SEQ ID 
NO: 8; SEQ ID NO: 46, a predicted kinase 2 domain, which 
extends from about amino acid 273 to about amino acid 288 

20 of SEQ ID NO: 8; SEQ ID NO : 4 7 , a predicted kinase 3a 

subdomain, which extends from about amino acid 32 7 to 
about amino acid 338 of SEQ ID NO : 8 ; SEQ ID NO: 14, ten 
predicted Leucine-rich repeats which extend from about 
amino acid 674 to about amino acid 950 of SEQ ID NO: 8. 

25 The first Leucine-rich repeat is predicted to extend from 
about amino acid 674 to about amino acid 701 of SEQ ID 
NO: 8; SEQ ID NO: 15. The second Leucine-rich repeat is 
predicted to extend from about amino acid 702 to about 
amino acid 72 7 of SEQ ID NO: 8; SEQ ID NO: 16. The third 

30 Leucine-rich repeat is predicted to extend from about 
amino acid 72 8 to about amino acid 754 of SEQ ID NO : 8 ; 
SEQ ID NO: 17. The fourth Leucine-rich repeat is 
predicted to extend from about amino acid 755 to about 
amino acid 782 of SEQ ID NO: 8; SEQ ID NO: 18. The fifth 

35 Leucine-rich repeat is predicted to extend from about 
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amino acid 783 to about amino acid 810 of SEQ ID NO : 8 ; 
SEQ ID NO: 19. The sixth Leucine-rich repeat is predicted 
to extend from about amino acid 811 to about amino acid 
83 8 of SEQ ID NO : 8 ; SEQ ID NO: 20. The seventh 
5 Leucine-rich repeat is predicted to extend from about 
amino acid 83 9 to about amino acid 866 of SEQ ID NO: 8; 
SEQ ID NO: 21. The eighth Leucine-rich repeat is 
predicted to extend from about amino acid 867 to about 
amino acid 894 of SEQ ID NO : 8 ; SEQ ID NO: 22. The ninth 

10 Leucine-rich repeat is predicted to extend from about 
amino acid 895 to about amino acid 922 of SEQ ID NO : 8 ; 
SEQ ID NO: 23 and the tenth leucine-rich repeat is 
predicted to extend from about amino acid 92 3 to about 
amino acid 950 of SEQ ID NO: 8; SEQ ID NO: 24. 

15 The human partial CARD - 4 S cDNA isolated as 

described above (Figure 5; SEQ ID NO:25) encodes a 490 
amino acid protein (Figure 6; SEQ ID NO: 26) . CARD- 4 S 
includes one predicted partial CARD domain (amino acids 
1-74 of SEQ ID NO:26). CARD - 4 S is also predicted to have 

2 0 a P-Loop which extends from about amino acid 163 to about 

amino acid 17 0 of SEQ ID NO: 26; SEQ ID NO: 29, and a 
predicted Walker Box "B ,! which extends form about amino 
acid 241 to about amino acid 245 of SEQ ID NO: 26; SEQ ID 
NO : 3 0 . 

25 A plot showing the predicted structural features 

of CARD-4L is presented in Figure 8. This figure shows 
the predicted alpha regions (Gamier- Robinson and 
Chou-Fasman) , the predicted beta regions 
(Gamier-Robinson and Chou-Fasman) , the predicted turn 

3 0 regions (Garnier-Robinson and Chou-Fasman) and the 

predicted coil regions (Garnier-Robinson and 
Chou-Fasman) . Also included in the figure is a 
hydrophilicity plot (Kyte-Doolittle) , the predicted alpha 
and beta-amphatic regions (Eisenberg) , the predicted 
35 flexible regions (Karplus-Schulz) , the predicted 
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antigenic index (Jameson-Wolf) and the predicted surface 
probability plot (Emini) . 

A plot showing the predicted sturctural features 
of CARD - 4 S is also presented in Figure 9. This figure 
5 shows the predicted alpha regions (Garnier-Robinson and 
Chou-Fasman) , the predicted beta regions 

(Garnier-Robinson and Chou-Fasman) , the predicted turn 
regions (Garnier-Robinson and Chou-Fasman) and the 
predicted coil regions (Garnier-Robinson and 
10 Chou-Fasman) . Also included in the figure is a 

hydrophilicity plot (Kyte-Doolittle) , the predicted alpha 
and beta-amphatic regions (Eisenberg) , the predicted 
flexible regions (Karplus-Schulz) , the predicted 
antigenic index (Jameson-Wolf) and the predicted surface 
15 probability plot (Emini) . 

The predicted MW of CARD- 3 is approximately 61 
kDa. The predicted MW of CARD- 4 L is approximately 10 8 
kDa . 

Example 3: Preparation of CARD -3 and CARD -4 Proteins 

Recombinant CARD- 3 and CARD- 4 can be produced in a 
variety of expression systems. For example, the CARD- 3 
and CARD- 4 peptides can be expressed as a recombinant 
glutathione-S-transf erase (GST) fusion protein in E. coli 
and the fusion protein can be isolated and characterized. 
Specifically, as described above, CARD-3 or CARD-4 can be 
fused to GST and the fusion protein can be expressed in 
E. coli strain PEB199. Expression of the GST-CARD-3 or 
GST-CARD-4 fusion protein in PEB199 can be induced with 
IPTG. The recombinant fusion protein can be purified 
from crude bacterial lysates of the induced PEB199 strain 
by affinity chromatography on glutathione beads. 



25 
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Example 4: Identification of splice variants of CARD 4 . 

The 5' untranslated sequence from CARD - 4 L, was used 
to search databases of cDNA sequences and partial cDNA 
sequences using BLASTN (Washington University; version 
5 2.0, BL.OSUM62 search matrix) for additional CARD-4 cDNA 
clones. This search led to the identification of two 
cDNA clones, clone Z from a human lymph node library and 
the Y clone from a human brain cDNA library. Both clones 
were sequenced and found to represent probable splice 

10 variants of CARD-4 that encode truncated CARD-4 proteins, 
Y encoding a 24 9 amino acid protein and Z encoding a 164 
amino acid protein. Fig. 10 shows the nucleotide (SEQ ID 
NO: 38) and Fig. 11 the predicted amino acid (SEQ ID 
NO: 39) sequences of human CARD-4 Y; Fig. 12 shows the 

15 nucleotide (SEQ ID NO:40) and Fig. 13 the amino acid (SEQ 
ID NO: 4 1) sequences of human CARD - 4 Z ; and Fig. 14 shows 
an alignment of the CARD - 4 L , CARD-4 Y, and CARD-4 Z amino 
acid sequences generated by the Clustal program using a 
PAM250 residue weight table. 

2 0 Example 5: Identification of murine CARD-4. 

The CARD-4 polypeptide sequence was used to search 
databases of cDNA sequences and partial cDNA sequences 
using the TBIiASTN program (version 1.4, BLOSUM62 search 
matrix, and a word length of 3) for murine CARD-4 cDNA 

25 clones. This search led to the identification of a 

partial murine CARD-4 clone designated murine CARD-4L. 
The rapid identification of cDNA ends procedure (RACE) 
was applied to the 5' end of the murine CARD- 4 L clone to 
elucidate the 5' end of the murine CARD-4L cDNA . Fig. 15 

30 shows the murine CARD-4L nucleotide sequence (SEQ ID 
NO: 42) , Figure 16 shows the murine CARD-4L amino acid 
sequence (SEQ ID NO:43), and Fig. 17 shows an alignment 
of the murine CARD-4L and human CARD-4L amino acid 
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sequences generated by the Clustal program using a PAM250 
residue weight table. 

Example 6: Identification of the chromosomal location 
of human CARD -4 . 

5 To determine the chromosomal location of the human 

CARD-4 gene, the polymerase chain reaction carried out 
with human CARD-4 -specif ic primers card4t, with the 5' to 
3' sequence agaaggt ctggtcggcaaa (SEQ ID NO: 44), and 
card4k, with the 5' to 3' sequence aagccctgagtggaagca 
10 ( SEQ ID NO: 45), was used to screen DNAs from a 

commercially available somatic cell hybrid panel . This 
analysis showed that human CARD-4 maps to chromosome 7 
close to the SHGC-31928 genetic marker. 



Example 7: Identification of CARD-3 in a yeast 
15 two-hybrid screen for proteins that physically interact 
with the CARD domain of human CARD-4. 

DNA encoding amino acids 1-145 of human CARD-4 
comprising the CARD domain was cloned into a yeast 
two-hybrid screening vector to create a CARD -4,1-145- GAL 4 

2 0 DNA-binding domain fusion for two-hybrid screening. The 
CARD-4 , 1-145-GAL4 DNA-binding domain fusion was used to 
screen human mammary gland and human prostate two-hybrid 
libraries for gene products that could physically 
associate with CARD-4 , 1 - 145 . Twelve library plasmids 

25 expressing CARD 4 , 1-145 interacting proteins were found to 
contain the CARD-domain containing protein CARD-3 thus 
establishing a direct or indirect physical interaction 
between CARD-4 and CARD-3. 

In addition, DNA encoding amino acids 435-540 of 

30 CARD-3 comprising the CARD domain of CARD-3 (SEQ ID NO: 6) 
was cloned into a yeast two-hybrid GAL 4 transcriptional 
activation domain fusion vector to create a CARD-3 ,435- 
540-GAL4 transcriptional activation domain fusion. To 



OKicrwv~irv ^\ur\ 



WO 99/40102 



PCT/US99/02544 



- 125 - 

test whether the CARD domain of CARD- 3 binds 
CARD-4 , 1-145 , the CARD- 3 , 43 5 - 54 0 -GAL4 transcriptional 
activation domain fusion expression vector and the 
CARD - 4,1-145 - GAL4 DNA-binding domain fusion vector were 
5 cotransf ormed into a two-hybrid screening Saccharomyces 
cerevisiae (yeast) strain. The resulting cotransf ormed 
yeast strain expressed the two reporter genes that 
indicate a physical interaction between the two hybrid 
proteins in the experiment, in this case, the CARD- 3, 43 5- 
10 540-GAL4 transcriptional activation domain fusion protein 
and the CARD-4 , 1-14 5-GAL4 DNA-binding domain fusion 
protein. This experiment established a physical 
interaction between the CARD domain of CARD-3 and the 
CARD domain of CARD-4. 

15 Example 8: Identification of hNUDC in a yeast 

two-hybrid screen for proteins that physically interact 
with the LRR domain of human CARD-4. 

DNA encoding amino acids 4 06-953 of human CARD- 4 L 
comprising the LRR domain was cloned into a yeast 

2 0 two-hybrid screening vector to create a 

CARD-4 , 4 06-953-GAL4 DNA-binding domain fusion for 
two-hybrid screening. The CARD-4 , 4 06- 953-GAL4 
DNA-binding domain fusion was used to screen a human 
mammary gland two-hybrid library for gene products that 

25 could physically associate with CARD-4 , 406- 953 . One 

library plasmid expressing a CARD4 , 406 - 953 interacting 
protein was found to contain the hNUDC protein, the human 
ortholog of the rat NUDC protein that has been implicated 
in nuclear movement (Morris et al . , Curr . Biol, 8:603 

30 [1998], Morris et al . , Exp. Cell Res. 238:23 [1998]), 

thus establishing a physical interaction between CARD-4 
and hNUDC. 
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Example 9: Discovery of regulation by CARD -4 of NF-kB. 

The first group of experiments described in this 
Example were carried out to determine if CARD-4 can 
activate the NF-/cB pathway. CARD-4 regulation of the 
NF-kB pathway is of interest because the NF-kB pathway is 
involved in many diseases described in (New England 
Journal of Medicine 336:1066 [1997]) and (American 
Journal of Cardiology 76:18C [1995]) and other references 
known to those skilled in the art. Participation of 
CARD-4 in the NF-kB pathway would make CARD-4 an 
attractive target for drugs that modulate the NF- kB 
pathway for treatment of NF-kB pathway-dependent 
diseases, conditions, and biological processes. 

The first group of experiments showed specific 
CARD-4 -mediated NF-kB pathway induction. 

The second group of experiments described in this 
Example were carried out to determine if CARD-3, the NIK 
serine/threonine protein kinase (Su et al . , EMBO J. 
16:1279 [1997]), or the signal transduction protein TRAF6 
(Cao et al., Nature 383:443 [1996]), proteins known to 
participate in the induction of NF-kB (McCarthy et al . , 
J. Biol. Chem. 273:16968 [1998]), are involved in 
transducing the CARD-4 -dependent NF-kB pathway induction 
signal. It was found that CARD-3, NIK, and TRAF6 are all 
involved in transducing the CARD-4 -mediated NF- kB pathway 
induction signal. 

In nine transfection experiments, 293T cells 
coexpressing an NF-kB reporter plasmid and either pCI , 
pCI-CARD-4L (expressing CARD-4L) , pCI -CARD- 4 S (expressing 
CARD-4S) , pCI-APAFL (expressing Apaf-1), pCI-APAFS 
(expressing an Apaf-1 variant lacking WD repeats) , 
pCI-CARD-4LnoCARD (expressing CARD - 4 L without a CARD 
domain) , pCI -CARD4LnoLRR (expressing CARD-4L without a 
LRR) , pCI-CARD4LCARDonly (expressing CARD - 4 L, CARD domain 
only) , or pCI - CARD4NBSonly (expressing CARD-4L nucleotide 
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binding sequence only) were created. 2 93T cells cells 
were plated in 6-well plates (35 mm wells) and 
transfected 2 days later (90% confluency) with 1 fig of 
NF-kB lucif erase reporter plasmid (pNF- kB-Luc , 
5 Stratagene) , 200 ng of pCMV fi-gal, 600 ng of pCI vector 
and 200 ng of indicated expression plasmids using 
SuperFect transfection reagent (Qiagen) . For dominant- 
negative experiments, 2 ng of CARD4 expressing plasmid 
and 800 ng of dominant -negative plasmid were used. Cells 

10 were harvested 48 h after transfection and luciferase 
activity in 1000-fold diluted cell extracts was 
determined using the Luciferase Assay System (Promega) . 
In addition, 6-galactosidase activities were determined 
and used to normalize transfection efficiency. 

15 Relative luciferase activity was determined at the 

end of the experiment to assess NF-kB pathway activation 
by the gene expressed by the pCI -based plasmid in each 
transfected cell line. The cell lines containing pCI , 
pCI-APAFS, pCI-APAFL, pC I - CARD - 4 LnoCARD , and pCI- 

2 0 CARD4NBSonly had similar baseline levels of luciferase 

expression but the cell lines containing pCI -CARD-4L, 
pCI-CARD4LnoLRR / and pCI -CARD4LCARDonly had luciferase 
expression about nine fold elevated relative to baseline 
and the cell line containing pCI-CARD4S had luciferase 
25 expression sixteen fold elevated relative to baseline. 

This result demonstrates induction by CARD-4S and CARD-4L 
of the NF-kB pathway. This CARD- 4 mediated NF-kB pathway 
induction is dependent on the CARD-4 CARD domain because 
the pCI -CARD-4noCARD construct expressing CARD-4 lacking 

3 0 its CARD domain did not induce the luciferase reporter 

gene and pCI -CARD4LCARDonly expressing the CARD-4 CARD 
domain did induce the luciferase reporter gene. Also, 
the CARD-4 LRR domains are not required for NF-kB pathway 
activation because pCI -CARD4LnoLRR expressing a CARD-4 
35 mutant protein lacking LRR domains is able to induce the 
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lucif erase reporter gene. In addition, the CARD-4 NBS 
domain is not sufficient for NF-kB pathway activation 
because pCI -CARD4NBSonly expressing CARD-4 NBS domain is 
not able to induce the lucif erase reporter gene. In 
5 addition, the induction of the NF-kB pathway by CARD-4 is 
specific, as neither Apaf -expressing construct in this 
experiment induced lucif erase activation. 

In five transfection experiments, 293T cells 
coexpressing an NF-*B reporter plasmid (NF- kB- lucif erase , 

10 Stratagene) and pCI-CARD-4L. and either, no vector, 

pCI-TRAF6-DN (expressing a dominant negative version of 
TRAF-6) , pCI-NIK-DN (expressing a dominant negative 
version of NIK kinase) , pCI -CARD 3 CARDonly (expressing the 
CARD domain of CARD- 3, which acts as a dominant negative 

15 version of CARD-3), or pCI-Bcl-XL (expressing the anti- 
apoptotic protein Bcl-XL) were created. TRAF6-DN, NIK- 
DN, and CARD3 -CARDonly are dominant negative alleles of 
the TRAF6, NIK, and CARD 3 genes, respectively. After 48 
hours, cells were lysed and the relative lucif erase 

20 activity was determined (Promega Kit) to assess NF-eB 
pathway activation by the genes expressed by the one or 
two pCI -based plasmids in each transfected cell line. The 
cell lines containing pCI - CARD - 4 L> only or pCI - CARD - 4 L. and 
pCI-Bcl-XL had relative lucif erase reporter gene 

25 expression of about 18 units. The cell lines containing 
pCI -CARD-4L and pCI -TRAF6 -DN, pCI-CARD-4L and pCI-NIK-DN, 
or pCI - CARD - 4 L and pCI -CARD3 CARDonly had relative 
lucif erase reporter gene expression of about 4 units. 
Inhibition of CARD - 4 L- mediated NF-kB pathway induction by 

30 TRAF6-DN, NIK-DN, and CARD- 3 CARDonly is specific as Bcl- 
XL did not inhibit CARD-4L-mediated NF-*B pathway 
induction . 

These results demonstrate that dominant negative 
alleles of TRAF6, NIK and CARD-3 expressed, respectively, 
3 5 from pCI -TRAF6 -DN, pCI-NIK-DN, and pCI -CARD3 CARDonly 
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block induction of the NF-kB reporter gene by CARD-4L 
expression (pCI -CARD-4L) and suggest that TRAF6, NIK, and 
CARD-3 act downstream of CARD- 4 L to transduce the CARD - 4 L 
NF-kB pathway induction stimulus. 
5 In an additional experiment, coexpression of CARD- 

4 and the CARD domain of RICK revealed that the CARD 
domain of RICK functions as a dominant negative mutant 
suggesting that RICK is a downstream mediator of CARD-4 
function. 

10 Example 10: Discovery of CARD -4 enhancement of caspase 
9 activity. 

In ten transfection experiments, 2 93T cells 
coexpressing a beta galactosidase-expressing plasmid 
(pCMV /3-gal from Stratagene) as a marker for viable cells 

15 and either pCI, pCI-CARD-3, pCI-APAF, pCI-CARD-4L, 

pCI-CARD-4S, pC I -CARD4 LnoLRR, pCI -CARD4NBSonly , pCI- 
CARD4 LCARDonly , pCI -CARD-4L.no CARD or pCI-casp9 
(expressing caspase-9) were created- Transf ect ions 
included 400 ng of pCMV /3-gal, 800 ng of expression 

20 plasmid, and Superfect transfection reagent from Qiagen 
and were carried out according to the manufacturer's 
directions. After 40-48 hours, cells were fixed and 
stained for beta-galactosidase expression and cell 
viability was determined by counting the number of beta 

25 galactosidase positive cells. Expression of pCI , pCI- 
CARD-3, pCI-APAF, pCI-CARD-4L, pCI-CARD-4S, pCI- 
CARD4 LnoLRR , pCI -CARD4NBSonly , pC I -CARD4 LCARDonly , and 
pC I - CARD - 4 LnoCARD did not result in loss of cell 
viability. As expected, expression of pCI-casp9 in 293T 

30 cells resulted in a loss of viability of about 75% of the 
cells in the experiment. 

It was next tested whether pCI , pC I -CARD- 3 , 
pCI-APAF, pCI -CARD-4L , pCI-CARD-4S, pC I - CARD4 LnoLRR , pCI- 
CARD4NBSonly, pCI -CARD4 LCARDonly , or pCI -CARD-4LnoCARD 
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would regulate caspase 9-mediated apoptosis. In nine 
transfection experiments, 293T cells coexpressing a beta 
galactosidase-expressing plasmid as a marker for viable 
cells, pCI-casp9, and either pCI , pCI-CARD-3, pCI-APAF, 
5 pCI -CARD-4L, pC I - CARD - 4 S , pCI - CARD4LnoLRR , pCI - 

CARD4NBSonly / pCI -CARD4LCARDonly , and p C I - CARD - 4 L no CARD 
were created. After 40-48 hours, cells were fixed and 
stained for beta-galactosidase expression and cell 
viability was determined by counting the number of beta 

10 galactosidase positive cells. Expression of pCI , 

pCI-CARD-4LnoCARD, and pCI -CARD4NBSonly in the caspase 
9-expressing 2 93T cells had no effect on the caspase 
9-induced apoptosis. However, pCI-CARD-3, pCI -CARD- 4L , 
pCI-CARD-4S, pCI-CARD4LnoLRR, pCI -CARD4L CARD only and, as 

15 expected, pCI-APAF enhanced the level of caspase 

9-induced apoptosis to 20 or less beta galactosidase 
positive cells per experiment from about 100 beta 
glactosidase positive cells per experiment. 

This experiment demonstrated that CARD- 4 can 

20 enhance caspase 9-mediated apoptosis because coexpression 
of CARD- 4 L or CARD- 4 S with caspase- 9 dramatically 
increases caspase- 9 mediated apoptosis. Furthermore, the 
CARD-4 CARD domain (SEQ ID NO: 10) is necessary and 
sufficient for CARD- 4 -mediated enhancement of caspase-9- 

25 potentiated apoptosis because CARD-4L lacking its CARD 
domain (pCI -CARD-4LjnoCARD) does not enhance caspase- 9- 
mediated apoptosis while the CARD-4 CARD domain expressed 
alone (pCI -CARD4LCARDonly) does induce caspase- 9 mediated 
apoptosis. In addition, the LRR present in CARD-4 is not 

3 0 required for CARD-4 enhancement of caspase- 9-mediated 

apoptosis because expression of a CARD-4 protein lacking 
the LRR ( pC I - CARD4 LnoLRR ) still enhances caspase-9- 
mediated apoptosis. The CARD-4 NBS is not sufficient for 
CARD-4 enhancement of caspase- 9 -mediated apoptosis 

35 because expression of the CARD-4 NBS only (pCI- 
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CARD4NBSonly) does not enhance caspase-9 mediated 
apoptosis. This experiment also demonstrates that CARD-3 
can enhance caspase- 9-mediated apoptosis. 

As detailed below in Example 12, CARD-4 does not 
5 appear to interact directly with caspase-9, suggesting 
that potentiation of caspase-9 activity by CARD-4 is 
mediated by activation of downstream pathways. 

Example 11: Identification and tissue distribution of 
mRNA species expressed by the human CARD-4 gene . 

10 Northern analysis of mRNAs extracted from adult 

human tissues revealed a 4.6 kilobase mRNA band that was 
expressed in most tissues examined. Highest expression 
was observed heart, spleen, placenta and lung. CARD-4 was 
also observed to be expressed in fetal brain, lung, liver 

15 and kidney. Cancer cell lines expressing the 4.6 kilobase 
CARD-4 mRNA include HeLa, K562, Molt4, SW480, A549 and 
melanoma. A larger 6.5 to 7.0 kilobase CARD-4 mRNA was 
expressed in heart, spleen, lung, fetal lung, fetal 
liver, and in the Molt4 and SW480 cell lines. 

20 Example 12: Physical association of CARD-4 with CARD-3. 

CARD-4-specif ic PCR primers with the 3' primer 
encoding the HA epitope tag were used to amplify the 
CARD- 4 L gene epitope tagged with HA and this PCR product 
was cloned into the mammalian expression vector pCl . 

2 5 CARD-3 -specif ic PCR primers with the 5' primer encoding 
the MYC epitope tag were used to amplify the CARD-3 gene 
epitope tagged with MYC and this PCR product was cloned 
into the mammalian expression vector pCI . CARD-3-specif ic 
PCR primers with the 5' primer encoding the MYC epitope 

30 tag were used to amplify the CARD-3 gene lacking the CARD 
domain (SEQ ID NO: 6) epitope tagged with MYC and this PCR 
product was cloned into the mammalian expression vector 
pCI . Caspase 9-specific PCR primers with the 3' primer 
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encoding the MYC epitope tag were used to amplify the 
caspase 9 gene epitope tagged with MYC and this PCR 
product was cloned into the mammalian expression vector 
pCI . In three transfection experiments, 293T cells 
5 coexpressing pCI -CARD-4LcHA and either pCI -CARD3nMYC , 
pCI-CARD3noCARDnMYC, or pCI -casp9cMYC were created. 
Cells from each transfected line were lysed and an 
immunoprecipitation procedure was carried out on each 
lysate with an anti-MYC epitope tag antibody to 

10 precipitate the CARD- 4LcHA expressed by each cell line 
and any physically associated proteins. 
Immunoprecipitated proteins were separated by 
electrophoresis on denaturing polyacrylamide gels, 
transferred to nylon filters, and probed with an ant i -HA 

15 epitope tag antibody in a Western blotting experiment to 
determine whether the MYC- tagged protein that was 
coexpressed with the CARD - 4 LcHA protein had 
coimmunoprecipitated with the CARD-4LcHA protein. In 
this experiment, CARD- 3 was found to coimmunoprecipitate 

20 with CARD-4 while CARD-3 lacking its CARD domain and 

caspase-9 did not coimmunoprecipitate with CARD-4. This 
experiment demonstrates that CARD-4 and CARD-3 physically 
associate and that CARD-3 requires its CARD domain to 
associate with CARD-4. In addition, CARD-4 appears to 

25 not associate with caspase-9. 

Example 13: CARD-4 Genomic Sequence 

Figure 18 is depicts the 32042 nucleotide genomic 
sequence of CARD-4. This sequence is based the CARD-4 
cDNA sequence described above and a BAC sequence (DBEST 
3 0 Accession No. AC006027) . The CARD-4 cDNA sequence 

described above was used to correct three errors in the 
BAC sequence, including one error resulting in a 
frameshift. The CARD-4 genomic sequence of Figure 18 
includes the following introns and exons : exon 1: 
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nucleotides 364-685, encoding amino acids 1-67 (start 
codon at nucleotides 485-487); intron 1: nucleotides 
686-2094; exon 2: nucleotides 2095-2269, encoding amino 
acids 67-126; intron 2: nucleotides 2270-4365; exon 3: 
5 nucleotides 366-6190, encoding amino acids 126-734; 
intron 3: nucleotides 6191-9024; exon 4: nucleotides 
9025-9108, encoding amino acids 734-762; intron 4: 
nucleotides 9109-10355; exon 5: nucleotides 10356-10439, 
encoding amino acids 762-790; intron 5: nucleotides 

10 10440-11181; exon 6: nucleotides 1182-11265, encoding 
amino acids 790-818; intron 6: nucleotides 11266-19749; 
exon 7: nucleotides 19750-19833, encoding amino acids 
818-846; intron 7: nucleotides 19834-21324; exon 8: 
nucleotides 21325-21408, encoding amino acids 846-874; 

15 intron 8: nucleotides 21409-24226; exon 9: nucleotides 
24227-24310, amino acids 874-903; intron 9: nucleotides 
24311-27948; exon 10: nucleotides 27949-28032, amino 
acids 903-930; intron 10: nucleotides 28033-31695; exon 
11: nucleotides 31696-32024, encoding amino acids 930-953 

20 (stop codon at nucleotides 31766-31768) . 

The introns in the CARD-4 genomic sequence contain 
consensus splice donor and acceptor sites (Molecular Cell 
Biology, Darnell et al . , eds.,1996). The CARD-4 genomic 
sequence is usful for genetic identification and mapping 

25 and identifying mutations, e.g., mutations is splice 
donor or splice acceptor sites. 

Equivalents 

Those skilled in the art will recognize, or be 
able to ascertain using no more than routine 
3 0 experimentation, many equivalents to the specific 

embodiments of the invention described herein. Such 
equivalents are intended to be encompassed by the 
following claims. 
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What is claimed is: 

1 . An isolated nucleic acid molecule selected 
from the group consisting of: 

a) a nucleic acid comprising the nucleotide 
5 sequence of SEQ ID NO: 38 or a complement thereof; 

b) a nucleic acid comprising the nucleotide 
sequence of SEQ ID NO: 40 or a complement thereof; 

c) a nucleic acid comprising the nucleotide 
sequence of SEQ ID NO: 42, or a complement thereof; 

10 d) a nucleic acid molecule which encodes a 

polypeptide comprising the amino acid sequence of SEQ ID 
NO:38; 

e) a nucleic acid molecule which encodes a 
polypeptide comprising the amino acid sequence of SEQ ID 

15 NO:40; 

f) a nucleic acid molecule which encodes a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO : 4 2 ; 

g) a nucleic acid comprising the nucleotide 

2 0 sequence of SEQ ID NO: 38 or a complement thereof; 

h) a nucleic acid consisting of the nucleotide 
sequence of SEQ ID NO: 40 or a complement thereof; 

i) a nucleic acid consisting of the nucleotide 
sequence of SEQ ID NO: 42, or a complement thereof; 

25 j) a nucleic acid molecule which encodes a 

polypeptide consisting of the amino acid sequence of SEQ 
ID NO: 38; 

k) a nucleic acid molecule which encodes a 
polypeptide consisting of the amino acid sequence of SEQ 

3 0 ID NO: 40; and 

1) a nucleic acid molecule which encodes a 
polypeptide consisting of the amino acid sequence of SEQ 
ID NO: 42 . 
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2 . A host cell which contains one of the nucleic 
acid molecules of claim 1 . 

3 . An isolated polypeptide selected from the 
group consisting of: 
5 a) a polypeptide comprising the amino acid 

sequence of SEQ ID NO: 39; 

b) a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 41; 

c) a polypeptide comprising the amino acid 
10 sequence of SEQ ID NO: 43; 

d) a polypeptide consisting of the amino acid 
sequence of SEQ ID NO: 39; 

e) a polypeptide consisting of the amino acid 
sequence of SEQ ID NO: 41; 

15 f) a polypeptide consisting of the amino acid 

sequence of SEQ ID NO: 43; 

g) a fragment of a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 39, wherein the fragment 
comprises at least 15 contiguous amino acids of SEQ ID 

2 0 NO : 3 9 ; 

h) a fragment of a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 41, wherein the fragment 
comprises at least 15 contiguous amino acids of SEQ ID 
NO : 4 1 ; 

2 5 i) a fragment of a polypeptide comprising the 

amino acid sequence of SEQ ID NO:43, wherein the fragment 
comprises at least 15 contiguous amino acids of SEQ ID 
NO: 43 ; 

j) a naturally occurring allelic variant of a 

3 0 polypeptide consisting of the amino acid sequence of SEQ 

ID NO: 39, wherein the polypeptide is encoded by a nucleic 
acid molecule which hybridizes to a nucleic acid molecule 
consisting of SEQ ID NO: 38 under stringent conditions; 
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k) a naturally occurring allelic variant of a 
polypeptide consisting of the amino acid sequence of SEQ 
ID NO: 41, wherein the polypeptide is encoded by a nucleic 
acid molecule which hybridizes to a nucleic acid molecule 
5 consisting of SEQ ID NO: 40 under stringent conditions; 
and 

1) a naturally occurring allelic variant of a 
polypeptide consisting of the amino acid sequence of SEQ 
ID NO: 43, wherein the polypeptide is encoded by a nucleic 
10 acid molecule which hybridizes to a nucleic acid molecule 
consisting of SEQ ID NO: 42 under stringent conditions. 

4. An antibody which selectively binds to any 
one of the polypeptides of claim 3. 

5. A method for producing a polypeptide selected 
15 from the group consisting of: 

a) a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 39; 

b) a polypeptide comprising the amino acid 
sequence of SEQ ID NO:41; 

20 c) a polypeptide comprising the amino acid 

sequence of SEQ ID NO: 43; 

d) a polypeptide consisting of the amino acid 
sequence of SEQ ID NO: 39; 

e) a polypeptide consisting of the amino acid 
25 sequence of SEQ ID NO: 41; 

f) a polypeptide consisting of the amino acid 
sequence of SEQ ID NO: 43; 

g) a fragment of a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 39, wherein the fragment 

30 comprises at least 15 contiguous amino acids of SEQ ID 
NO : 3 9 ; 

h) a fragment of a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 41, wherein the fragment 
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comprises at least 15 contiguous amino acids of SEQ ID 
NO: 41; 

i) a fragment of a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 43, wherein the fragment 
5 comprises at least 15 contiguous amino acids of SEQ ID 
NO: 43 / 

j ) a naturally occurring allelic variant of a 
polypeptide consisting of the amino acid sequence of SEQ 
ID NO: 39, wherein the polypeptide is encoded by a nucleic 

10 acid molecule which hybridizes to a nucleic acid molecule 
consisting of SEQ ID NO: 38 under stringent conditions; 

k) a naturally occurring allelic variant of a 
polypeptide consisting of the amino acid sequence of SEQ 
ID NO: 41, wherein the polypeptide is encoded by a nucleic 

15 acid molecule which hybridizes to a nucleic acid molecule 
consisting of SEQ ID NO: 40 under stringent conditions; 
and 

1) a naturally occurring allelic variant of a 
polypeptide consisting of the amino acid sequence of SEQ 
20 ID NO: 43, wherein the polypeptide is encoded by a nucleic 
acid molecule which hybridizes to a nucleic acid molecule 
consisting of SEQ ID NO: 42 under stringent conditions; 

comprising the step of culturing the host cell of 
claim 2 under conditions in which the nucleic acid 
25 molecule is expressed. 

6 . A method for detecting the presence of a 
polypeptide of claim 2 in a sample, comprising: 

a) contacting the sample with a compound which 
selectively binds to a polypeptide of claim 2; and 
3 0 b) determining whether the compound binds to the 

polypeptide of claim 2 in the sample. 
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7. A method for detecting the presence of a 
nucleic acid molecule of claim 1 in a sample, comprising 
the steps of : 

a) contacting the sample with a nucleic acid probe 
5 or primer which selectively hybridizes to the nucleic 

acid molecule; and 

b) determining whether the nucleic acid probe or 
primer binds to a nucleic acid molecule in the sample. 



8. A method for identifying a compound which 
10 binds to a polypeptide of claim 2 comprising the steps 

of: 

a) contacting a polypeptide, or a cell 
expressing a polypeptide of claim 2 with a test compound; 
and 

15 b) determining whether the polypeptide binds to 

the test compound. 

9. The method of claim 8, wherein the binding of 
the test compound to the polypeptide is detected by a 
method selected from the group consisting of: 

20 a) detection of binding by direct detecting of 

test compound/polypeptide binding; 

b) detection of binding using a competition 
binding assay; and 

c) detection of binding using an assay for 
25 CARD- 4 L or CARD - 4 S mediated signal transduction. 



10. A method for modulating the activity of a 
polypeptide of claim 2 comprising contacting a 
polypeptide or a cell expressing a polypeptide of claim 2 
with a compound which binds to the polypeptide in a 
30 sufficient concentration to modulate the activity of the 
polypeptide . 
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11- A method for identifying a compound which 
modulates the activity of a polypeptide of claim 2, 
comprising : 

a) contacting a polypeptide of claim 2 with a 
5 test compound; and 

b) determining the effect of the test compound 
on the activity of the polypeptide to thereby identify a 
compound which modulates the activity of the polypeptide. 

12. A method for identifying a compound that 
10 blocks the interaction between a CARD-4 protein 

comprising a CARD-4 domain and a CARD-4-interacting 
protein comprising the steps of: 

a) incubating said CARD-4 protein and said 
interactor in the presence and absence of a test agent; 
15 b) determining whether said test agent reduces the 

binding of said CARD-4 protein and said interactor; and 

c) identifying a compound that blocks the 
interaction of said CARD-4 protein with said interactor 
when said compound reduces the binding of said CARD-4 

2 0 protein with said interactor; 

wherein said interactor is selected from the group 
consisting of CARD- 3 and hNUDC and wherein said CARD-4 
domain comprises amino acids 1-14 5 of an amino acid 
sequence selected from the group consisting of SEQ ID 
25 NO: 8 and SEQ ID NO: 43. 

13. The method of claim 12, wherein the CARD-4 
protein comprising a CARD-4 domain is selected from the 
group consisting of: 

a) a polypeptide comprising the amino acid 

3 0 sequence of SEQ ID NO: 8; 

b) a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 39; 
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c) a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 41; and 

d) a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 43. 

5 14. The method of claim 12, wherein the CARD-4 

protein and interactor are expressed in a recombinant 
prokaryotic or eukaryotic cell line or wherein the CARD-4 
protein and interactor are isolated proteins or present 
in cell-free protein extracts. 

10 15. A method for identifying a compound that 

inhibits the induction of the NF-/cB pathway by a CARD-4 
protein comprising the steps of : 

a) incubating a recombinant cell line containing 
a vector expressing CARD-4 in the presence and absence of 

15 a test agent; 

b) determining whether said test agent inhibits 
the induction of the NF-kB pathway by CARD-4; and 

c) identifying a compound that inhibits the 
induction of the NF-kB pathway by CARD-4 . 

20 16. The method of claim 15, wherein the CARD-4 

protein comprising a CARD-4 domain is selected from the 

group consisting of: 

a) a polypeptide comprising the amino acid 

sequence of SEQ ID NO: 8; 
25 b) a polypeptide comprising the amino acid 

sequence of SEQ ID NO: 39; 

c) a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 41; and 

d) a polypeptide comprising the amino acid 
30 sequence of SEQ ID NO: 43. 
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17. The method of claim 15 further comprising: 

a) incubating a recombinant cell line expressing 
CARD-4 and also expressing an NF-kB pathway reporter gene 
in the presence and absence of a test agent; 

b) determining whether said test agent inhibits 
the induction of the NF-eB pathway reporter gene by 
CARD - 4 ; and 

c) identifying a compound that inhibits the 
induction of the NF-kB pathway reporter gene by CARD-4 . 

18. A method for identifying a compound that 
inhibits the enhancement of caspase 9 activity by a 
CARD-4 protein comprising a CARD-4 domain comprising the 
steps of : 

a) incubating a recombinant cell line expressing 
caspase 9 and CARD-4 in the presence and absence of a 
test agent ; 

b) determining whether said test agent inhibits 
caspase 9 activity; and 

c) identifying a compound that inhibits the 
enhancement of caspase 9 activity by a CARD-4 protein. 

19. The method of claim 18 further comprising: 

a) incubating a recombinant cell line expressing 
caspase 9 and CARD-4 and a beta-galactosidase expression 
vector in the presence and absence of a test agent ; 

b) determining whether the presence of said test 
agent increases the proportion of cells that stain 
positive for beta-galactosidase; and 

c) identifying a compound that inhibits the 
enhancement of caspase 9 activity by a CARD-4 protein by 
identifying a compound that increases the proportion of 
cells that stain positive for beta-galactosidase. 
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20. The method of claim 18, wherein the CARD-4 
protein comprising a CARD-4 domain is selected from the 
group consisting of: 

a) a polypeptide comprising the amino acid 
5 sequence of SEQ ID NO : 8 ; 

b) a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 39; 

c) a polypeptide comprising the amino acid 
sequence of SEQ ID NO: 41; and 

10 d) a polypeptide comprising the amino acid 

sequence of SEQ ID NO: 43. 

21. A method for identifying a compound that 
inhibits the enhancement of caspase 9 activity by a 
CARD-3 protein comprising a CARD-3 domain comprising the 

15 steps of : 

a) incubating a recombinant cell line expressing 
caspase 9 and CARD-3 in the presence and absence of a 
test agent; 

b) determining whether said test agent inhibits 
20 caspase 9 activity; and 

c) identifying a compound that inhibits the 
enhancement of caspase 9 activity by a CARD-3 protein. 

22. The method of claim 21 further comprising: 

a) incubating a recombinant cell line expressing 
25 caspase 9 and CARD-3 and a beta-galactosidase expression 

vector in the presence and absence of a test agent; 

b) determining whether the presence of said test 
agent increases the proportion of cells that stain 
positive for beta-galactosidase; and 
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c) identifying a compound that inhibits the 
enhancement of caspase 9 activity by a CARD-3 protein by 
identifying a compound that increases the proportion of 
cells that stain positive for beta-galactosidase . 



WO 99/40102 



PCT/US99/02544 



Z CACGCGTCCGGTCAGC . CTGG - . . — -urtunrtvju^uCGGs — » GGGCCA 1 C G GGGGAATGGGC 

3CCCTC3TGACC^AGTGTTGCGGGGCAAAAAGGGTCTTGCCGGCCTCGCTCGTGCAGGGGCGTAT 
ZTGGGCGCCTGAGCGCGGCGTGGGAGwv. - ~ ovjvjAGCCGCCGCAGCAGGGGGCACACCCGGAACCG 
G C CT G AG CG CC CGGGACCAT GAAC GGGG AGG C CATCTGCAGCGCCCTGCCC AC C ATT CCCTACC A 

zaaactcgccgacctgcgctacctgagccgcggcgcctctggcactgtgtcgtcggcccgccacg 
cagactggcgcgtccaggtggcggtgaagcacctgcacatccacactccgctgctcgacagtgaa 
agaaaggatgtcttaagagaagctgaaattttacacaaagctagatttagttacattcttccaat 

7TTGGGAATTTGCAATGAGCCTGAATTTTTGGGAATAGTTACTGAATACATGCCAAATGGATCAT 

taaatgaactcgtacataggaaaactgaatatcctgatgttgcttggccattgagatttggtatc 
ctgcatgaaattgcccttggtgtaaattacctgcacaatatgactcctcctttacttgatcatga 
cttgaagactcagaatatcttattggacaatgaatttcatgttaagattgcagattttggtttat 

C AAAGTG GC G CAT G AT GTCCGTGTG A CAGTCACGAAGTAGCAAATCTGCACC AG AAGGAGGGACA 
ATTATCTATATGCCACCTGAAAACTATGAACCTGGACAAAAATCAAGGGCCAGTATCAAGCACGA 
TATATATAGCTATGCAGTTATCACATGGGAAGTGTTATCCAGAAAACAGCCTTTTGAAGATGTCA 
CCAATCCTTTGCAGATAATGTATAGTGTGTCACAAGGACATCGACCTGTTATTAATGAAGAAAGT 
7TGCCATATGATATACCTCACCGAGCACGTATGATCTCTCTAATAGAAAGTGGATGGGCACAAAA 
TCCAGATGAAAGACCATCTTTCTTAAAATGTTTAATAGAACTTGAACCAGTTTTGAGAACATTTG 
AAGAGATAACTTTTCTTGAAGCTGTTATTCAGCTAAAGAAAACAAAGTTACAGAGTGTTTCAAGT 
G C CATTCAC CTATGTGACAAGAAG AAAAT GGAATT ATCTCTG AACATAC CTGTAAATCAT GGTC C 
ACAAGAGGAATCATGTGGATCCTCTCAGCTCCATGAAAATAGTGGTTCTCCTGAAACTT CAAGGT 
CCCTGCCAGCTCCTCAAGACAATGATTTTTTATCTAGAAAAGCTCAAGACTGTTATTTTATGAAG 
GTGCATCACTGTCCTGGAAATCACAGTTGGGATAGCACCATTTCTGGATCTCAAAGGGCTGCATT 
3TGTGATCACAAGACCATTCCATGCTCTTCAGCAATAATAAATCCACTCTCAACTGCAGGAAACT 
CAGAACGTCTGCAGCCTGGTATAGCCCAGCAGTGGATCCAGAGCAAAAGGGAAGACATTGTGAAC 
CAAATGACAGAAGCCTGCCTTAACCAGTCGCTAGATGCCCTTCTGTCCAGGGACTTGATCATGAA 
AGAGGACTATGAACTTGTTAGTACCAAGCCTACAAGGACCTCAAAAGTCAGACAATTACTAGACA 
CTACTGACATCCAAGGAGAAGAATTTGCCAAAGTTATAGTACAAAAATTGAAAGATAACAAACAA 
ATGGGTCTTCAGCCTTACCCGGAAATACTTGTGGTTTCTAGATCACCATCTTTAAATTTACTTCA 
AAATAAAAGCATGTAAGTGACTGTTTTTCAAGAAGAAATGTGTTTCATAAAAGGATATTTATAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA (SEQ ID NO:l) 
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^TCGCAGCT TGGAAGAGAC AGARCAATTC CAGAAWTAAA 7" 
AGA77TAACC AATGTTGT77 7AAAA7A77C 7AAC77CAAA GAATGATGCC AGAAC7TWAA 
AAGGGRCTGC GCAGAGTAGC ~GGGGCCCZ~ GAGGGCGCGG CCTGAATCCT GATTGCCC7T 
TTGCTGAGAG GACACACGCA GCTGAAGA7G AAT7TGGGAA AAGTAGCCGC TTGCTACTTT 
AAC7A7GGAA GAGCAGGGCC ACAGTGAGAT GGAAATAATC C CAT CAGAGT CTCACCCGGA 
CATTCAATTA C7GAAAAGCA ATCGGGAACT 7C7GGTCACT CACATCCGCA ATACTCAGTG 
TCTGGTGGAC AACTTGCTGA AGAATGACTA CTTCTCGGCC GAAGATGCGG AGATTGTGTG 
7GCC7GCCCC ACCCAGCCTG ACAAGGTCCG CAAAATTCTG GACCTGGTAC AGAGCAAGGG 
CGAGGAGGTG 7CCGAGTTCT TCCTCTACTT GCTCCAGCAA CTCGCAGATG C CTACG7GG A 
CCTCAGGCC7 7GGCTGCTGG AGATCGGCTT CTCCCCTTCC CTGCTCACTC AGAGCAAAGT 
CGTGG7CAAC ACTGACCCAG 7 GAG CAGGT A TACCCAGCAG CTGCGACACC ATCTGGGCCG 
7GAC7C7AAG 77CGTGCTGT GCTATGCCGA GAAGGAGGAG CTGCTGCTGG AGGAGA7C7A 
CATGGACACC ATCATGGAGC 7GGTTGGCTT CAGCAATGAG AGCCTGGGCA GCCTGAACAG 
CCTGGCCTGC CTCCTGGACC ACACCACCGG CATCCTCAAT GAGCAGGGTG AGACCATCT7 
CATCC7GGGT GATGCTGGGG 7GGGCAAGTC CATGCTGCTA CAGCGGCTGC AGAGCCTCTG 
GGCCACGGGC CGGCTAGACG CAGGGGTCAA ATTCTTCTTC CACTTTCGCT GCCGCATGTT 
CAGCTGCTTC AAGGAAAGTG ACAGGCTGTG 7CTGCAGGAC CTGCTCTTCA AGCACTACTG 
CTACCGAGAG ZZGGXCCCCZ AGGAGGTGTT TGCCTTCCTG CTGCGCTTCC CCCACGTGGC 
CCTC77CACC TTCGATGGCC 7GGACGAGCT G CACTCGGAC TTGGACCTGA GCCGCGTGCC 
7GACAGCTCC 7GCCCCTGGG AGCC7GCCCA CCCCCTGGTC TTGCTGGCCA ACCTGCTCAG 
TGGGAAGCTG C7CAAGGGGG CTAGCAAGCT GCTCACAGCC CGCACAGGCA 7GGAGGTCCC 
GCGCCAGTTC CTGCGGAAGA AGGTGCTTCT CCGGGGCTTC TCCCCCAGCC ACCTGCGCGC 
CTATGCCAGG AGGATGTTCC CCGAGCGGgC CCTGCAGGAC CGCCTGCTGA GCCAGCTGGA 
GGCCAACCCC AACCTCTGCA GCCTGTGCTC TGTGCCCCTC TTCTGCTGGA TCATCTTCCG 
GTGCTTCCAG CACTTCCGTG CTGCCTTTGA AGGCTCACCA CAGCTGCCCG ACTGCACGAT 
GACC77GACA GATGTCTTCC 7CCTGGTCAC TGAGGTCCAT CTGAACAGGA TGCAGCCCAG 
CAGCC7GGTG CAGCGGAACA CACGCAGCCC AGTGGAGACC CTCCACGCCG GCCGGGACAC 
7C7GTGCTCG C7GGGGCAGG TGGCCCACCG GGGCATGGAG AAGAGCCTCT TTGTCTTCAC 
CCAGGAGGAG G7GCAGGCCT CCGGGCTGCA GGAGAGAGAC ATGCAGCTGG GCTTCCTGCG 
GGCTTTGCCG GAGCTGGGCC GCGGGGGTGA CCAGCAGTCC TATGAGTTTT 7CCACC7CAC 
CC7CGAGGCC 77CTTTACAG CCTTCTTCGT CGTGCTGGAC GACAGGGTGG GCACTCAGGA 
GC7GC7CAGG 77CTTCCAGG AGTGGATGCC CCCTGCGGGG GCAGCGACCA CGTCCTGCTA 
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. CCTCCCTTC CTCCCGT\L"CC AGXGCCTGCA 
Z AAG AAC AAG GATCACTTCC AGTTCACCAA 
CAAACAGAAA CTCCTGCOGC ATCTGGTGCC 
CACCTGTTTT CCAGCCTGCG 
GGTCGAAAGC 7TCAACCAGG TG C AGGC CAT 
Z 7 AC GAGACA CAGAGCCAGA AGGTGGGGCA 
CCTCAAGCTG ACCTACTGCA ACGCCTGCTC 
3CATCACTTG CCCAAGCGGC TGGCCCTAGA 
CGTGCGGGAG CZTGCAGCCCT G CT T C AG C C G 
GATCACTGAC GGTGGGGTAA AGGTGCTAAG 
CTATTTG GGT TTATACAACA AC CAGATCAC 
ZCTGGATGAA TGCAAAGGCC TCACGCAT'CI* 
AGGAGGGAAG TATCTCGCCC TGGCTGTGAA 
GTGGGGCAAT CAAGTTGGGG ATGAAGGAGC 
CCCCAGCTTG ACCACCCTGA GTCTTGCGTC 
CCTTGCGAGG GCCCTGCAGC AGAACACGTC 
ACTCAACGAT GAAGTGGCAG AGAGTTTGGC 
GCATTTATGG CTTATCCAGA ATCAGATCAC 
GTTACAGAGC AACACTGGCA TAACAGAGAT 
GGAGGCCAAA GTCTATGAAG ATGAGAAGCG 
3TTCATGGGG T* i I"i XGCCCT GGAGCCTCAG 
GTCAGTGTCT TAAAGGGGCC TGCGCAGGCG 
GCAAGCCAGC TTCCTGTGCA GAAGGTCTGG 
TCTGCAGAAA AAGAATGTGT CTTGCGAGCT 
GAAAAAAAAA ACGGACGCGT GG (SEQ II 



GGGCAGTGGT 


CCGGCGCGGG 


AAGAC CT" CTT 


CCTCTTCCTG 


TGCGGGCTGT 


TGTCCAAAGC 


CGCGGCAGCC 


CTGAGGAGAA 


AGCGCAAGGC 


GGGCTACCTG 


AAGAGCCTGC 


CCCGCGTTCA 


GCCCACG xTC 




TGCGCTGCAT 


G CTGGCGGCC 


AGGGGCATCT 


GCGCCAACTA 


GGCCGACTGC 


AGCGCCCTCT 


CCTTCGTCCT 


C CTAGACAAC 


AACAATCTCA 


ACGACTACGG 


CCTCACTGTT 


CTCAGACTCA 


GCGTAAACCA 


CG AAGAGCXG 


ACCAAATACA 


AAATTGTGAC 




GCCAGGTACG 


TCACCAAAAT 


TAAACTGGGA 


AAAAACAAAA 


TAACAAGTGA 


GAACAGCAAA 


TCAATCTCTG 


AGGTTGGGAT 


AAAAGCCTTC 


GCAGAGGCTC 


TGCGGAACCA 


CAACGGCATC 


TCCACAGAAG 


GAGGAAAGAG 


TCTAGAAATA 


CTGTGGCTGA 


CCCAAAATGA 


AGAAATGTTG 


AAAGTCAACC 


AGACGTTAAA 


AGCTAAGGGG 


ACTGCCCAGC 


TGGCAGATGC 
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IACGCGTCCGACTTGCTGAAGAATGACTACTTCTCGGCCGAAGATGCGGAGATTGTGT 

3TGCCTGCCCCACCCAGCCTGACAAGGTCCGCAAAATTCTGGACCTGGTACAGAGCAAG 

3GCGAGGAGGTGTCCGAGTTCTTCCTCTACTTGCTCCAGCAACTCGCAGATGCCTACGT 

GGACCTCAGGCCTTGGCTGCTGGAGATCGGCTTCTCCCCTTCCCTGCTCACTCAGAGCA 

.\AGTCGTGGTCAACACTGACCCAGTGAGCAGGTATACCCAGCAGCTGCGACACCATCTG 

3GCCGTGACTCCAAGTTCG7GCTGTGCTATGCCCAGAAGGAGGAGC7GCTGCTGGAGGA 

3ATCTACATGGACACCATCATGGAGCTGGTTGGCTTCAGCAATGAGAGCCTGGGCAGCC 

TGAACAGCCTGGCCTGCCTCCTGGACCACACCACCGGCATCCTCAATGAGCAGGGTGAG 

ACCATCTTCATCCTGGGTGATGCTGGGGTGGGCAAGTCCATGCTGCTACAGCGGCTGCA 

GAGCCTCTGGGCCACGGGCCGGCTAGACGCAGGGGTCAAATTCTTCTTCCACTTTCGCT 

GCCGCATGTTCAGCTGCTTCAAGGAAAGTGACAGGCTGTGTCTGCAGGACCTGCTCTTC 

AAGCACTACTGCTACCC^GAGCGGGACCCCGAGGAGGTGTTTGCCTTCCTGCTGCGCTT 

CCCCCACGTGGCCCTCTTCACCTTCGATGGCCTGGACGAGCTGCACTCGGACTTGGACC 

TGAGCCGCGTGCCTGACAGCTCCTGCCCCTGGGAGCCTGCCCACCCCCTGGTCTTGCTG 

GCCAACCTGCTCAGTGGGAAGCTGCTCAAGGGGGCTAGCAAGCTGCTCACAGCCCGCAC 

AGGCATCGAGGTCCCGCGCCAGTTCCTGCGGAAGAAGGTGCTTCTCCGGGGCTTCTCCC 

CCAGCCACCTGCGCGCCTATGCCAGGAGGATGTTCCCCGAGCGGGCCCTGCAGGACCGC 

rTGCTGAGCCAGCTGGAGGCCAACCCCAACCTCTGCAGCCTGTGCTCTGTGCCCCTCTT 

CTGCTGGATCATCTTCCGGTGCTTCCAGCACTTCCGTGCTGCCTTTGAAGGCTCACCAC 

AGCTGCCCGACTGCACGATGACCCTGACAGATGTCTTCCTCCTGGTCACTGAGGTCCAT 

CTGAACAGGATGCAGCCCAGCAGCCTGGTGCAGCGGAACACACGCAGCCCAGTGGAGAC 

CCTCCACGCGGGCCGGGACACTCTGTGCTCGCTGGGGCAGGTGGCCCACCGGGGCATGG 

AGAAGAGCCTCTTTGTCTTCACCCAGGAGGAGGTGCAGGCCTCCGGGCTGCAGGAGAGA 

GACATGCAGCTGGGCTTCCTGCGGGCTTTGCCGGAGCTGGGCCCCGGGGGTGACCAGCA 

GTCCTATGAGTTTTTCCACCTCAGCCTCCTCACCTGTAAAACTGGGATCCCAGTATAGA 

CTTTGGAAATCAGTAGACACCATATGCTTCAAAAAACAGGGGCTATTAAAATGACATCA 

GGAGCCAGAAAGTCTCATGGCTGTGCTTTCTCTTGAAGTTTATACAACAACCAGATCAC 

CGATGTCGGAGCCAGACTGGGAAAAAACAAAATAACAAGTGAAGGAGGGAAGTATCTCG 

CCCTGGCTGTGAAGAACAGCAAATCAATCTCTGAGGTTGGGATGTGGGGCAATCAAGTT 

GGGGATGAAGGAGCAAAAGCCTTCGCAGAGGCTCTGCGGAACCACCCCAGCTTGACCAC 

CCTGAGTCTTGCGTCCAACGGCATCTCCACAGAAGGAGGAAAGAGCCTTGCGAGGGCCC 

TGCAGCAGAACACGTCTCTAGAAATACTGTGGCTGACCCAAAATGAACTCAACGATGAA 

3TGGCAGAGAGTTTGGCAGAAATGTTGAAAGTCAACCAGACGTTAAAGCATTTATGGCT 

TATCCAGAATCAGATCACAGTCTTTTGTGTCAGTGTCTTAAAGGGGCCTGCGCAGGCGG 

GACTATCAGGAGTCCACTGCCTCCATGATGCAAGCCAGCTTCCTGTGCAGAAGGTCTGG 

TCGGCAAACTCCCTAAGTACCCGCTACAATTCTGCAGAAAAAGAATGTGTCTTGCGAGC 

TGTTGTAGTTACAGTAAATACACTGTGAAGAGACTTTATTGCCTATTATAATTATTTTT 

ATCTGAAGCTAGAGGAATAAAGCTGTGAGCAAACAGAGGAGGCCAGCCTCACCTCATTC 

CAACACCTGCCATAGGGACCAACGGGAGCGAGTTGGTCACCGCTCTTTTCATTGAAGAG 

TTGAGGATGTGGCACAAAGTTGGTGCCAAGCTTCTTGAATAAAACGTGTTTGATGGATT 

AGTATTATACCTGAAATATTTTCTTCCTTCTCAGCACTTTCCCATGTATTGATACTGGT 

CCCACTTCACAGCTGGAGACACCGGAGTATGTGCAGTGTGGGATTTGACTCCTCCAAGG 

TTTTGTGGAAAGTTAATGTCAAGGAAAGGATGCACCACGGGCTTTTAATTTTAATCCTG 

GAGTCTCACTGTCTGCTGGCAAAGATAGAGAATGCCCTCAGCTCTTAGCTGGTCTAAGA 

ATGACGATGCCTTCAAAATGCTGCTTCCACTCAGGGCTTCTCCTCTGCTAGGCTACCCT 

CCTCTAGAAGGCTGAGTACCATGGGCTACAGTGTCTGGCCTTGGGAAGAAGTGATTCTG 

TCCCTCCAAAGAAATAGGGCATGGCTTGCCCCTGTGGCCCTGGCATCCAAATGGCTGCT 

TTTGTCTCCCTTACCTCGTGAAGAGGGGAAGTCTCTTCCTGCCTCCCAAGCAGCTGAAG 

3GTGACTAAACGGGCGCCAAGACTCAGGGGATCGGCTGGGAACTGGGCCAGCAGAGCAT 

GTTGGACACCCCCCACCATGGTGGGCTTGTGGTGGCTGCTCCATGAGGGTGGGGGTGAT 

ACTACTAGATCACTTGTCCTCTTGCCAGCTCATTTGTTAATAAAATACTGAAAACACAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAA (SEQ ID NO: 25) 
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CCCGCGTCCGCGTCCCCGGACCATGGCGCTCTCCGGGCTCTTCTCTAGC7CTCAGCGGCT 
GCGAAGTCTGTNAACCTGGTGGCCAAGTGATTGTAAGTCAGGAGACTTTCCTTGGGTTTC 
TGCCTTTGATGGCAAGAGGTGGAGATTGTGGCGGCGATTACAGAAAACATGTGGGAAGAG 
AAGTTGCTGTTTTTATGGGAATCGGAGGCTTGGAAGAGACAGAAGCAATTGCAGAAATAA 
ATTGGAAATTGAAGATTTAAACAATGTTGTTTTAAAATATTCTAACTTCAAAGAATGATG 
CGAGAAACTTAAAAAGGGGCTGCGCAGAGTAGCAGGGGGCCTGGAGGGCGCGGCGTGAAT 
CCTGATTGCCCTTCTGCTGAGAGGACACACGCAGCTGAAGATGAATTTGGGAAAAGTAGG 
CGGTTGCTACTTTAACTATGGAAGAGCAGGGCCACAGTGAGATGGAAATAATCCCATCAG 
AGTCTCACCCCCACATTCAATTACTGAAAAGCAATCGGGAACTTCTGGTCACTCACATCG 
GCAATACTCAGTGTCTGGTGGACAACTTGCTGAAGAATGACTACTTCTCGGCCGAAGATG 
CGGAGATTGTGTGTGCCTGCCCGACCCAGCCTGACAAGGTCCGCAAAATTCTGGACCTGG 
TACAGAGCAAGGGCGAGGAGGTGTGCGAGTTCTTCGTCTACTTGCTCCAGCAACTCGCAG 
ATGCCTACGTGGACCTCAGGCCTTGGCTGCTGGAGATCGGCTTCTCCCCTTCCCTGCTGA 
CTCAGAGCAAAGTCGTGGTCAACACTGACCCAGTGAGCAGGTATACCCAGCAGCTGCGAC 
ACCATCTGGGCCGTGACTCCAAGTTCGTGCTGTGCTATGCCCAGAAGGAGGAGCTGCTGC 
TGGAGGAGATCTACATGGACACCATCATGGAGCTGGTTGGCTTCAGCAATGAGAGCCTGG 
GCAGCCTGAACAGGCTGGCCTGCCTGCTGGACCACACCACCGGCATCCTCAATGAGCAGG 
CTGCTTCAAGGAAAGTGACAGGCTGTGTCTGCAGGACCTGCTCTTCAAGCACTACTGCTA 
CCCAGAGCGGGACCCCGAGGAGGTGTTTGCCTTCCTGCTGCGCTTCCCCCACGTGGCCCT 
GTTCACCTTCGATGGCCTGGACGAGCTGCAGTCGGACTTGGACCTGAGCCGCGTGCCTGA 
CAGCTCCTGCCCCTGGGAGCCTGCCCACCCCCTGGTCTTGCTGGCCAACCTGCTCAGTGG 
GAAGCTGCTCAAGGGGGCTAGCAAGCTGCTCACAGCCCGCACAGGCATCGAGGTCCCGCG 
CCAGTTCCTGCGGAAGAAGGTGCTTCTCCGGGGCTTCTCCCCCAGCCACCTGCGCGCCTA 
TGCCAGGAGGATGTTCCCCGAGCGGGCCCTGCAGGACCGCCTGCTGAGCCAGCTGGAGGC 
CAACCCCAACCTCTGCAGCCTGTGCTCTGTGCCCCTCTTCTGCTGGATCATCTTCCGGTG 
CTTCCAGCACTTCCGTGCTGCCTTTGAAGGGTCACCACAGCTGCCCGACTGGACGATGAC 
CCTGACAGATGTCTTCCTCCTGGTCACTGAGGTCCATCTGAACAGGATGCAGCCGAGCAG 
CCTGGTGCAGCGGAACACAGGCAGCCCAGTGGAGACCCTCCACGCCGGCCGGGACAGTCT 
GTGCTCGCTGGGGCAGGTGGCCCACCGGGGCATGGAGAAGAGCCTCTTTGTCTTCACCCA 
GGAGGAGGTGCAGGCCTCCGGGCTGCAGGAGAGAGACATGCAGCTGGGCTTCCTGCGGGG 
TTTGCCGGAGCTGGGCCCCGGGGGTGACCAGCAGTCCTATGAGTTTTTCCACCTCACCCT 
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CCAGGCCTTCTTTACAGCCTTCTTCCTCGTGCTGGACGACAGGGTGGGCACTCAGGAGCT 
GCTCAGGTTCTTCCAGGAGTGGATGCCCCCTGCGGGGGCAGCGACCACGTCCTGCTATCC 
TCCCTTCCTCCCGTTCCAGTGCCTGCAGGGCAGTGGTCCGGCGCGGGAAGACCTCTTCAA 
GAACAAGGATCACTTCCAGTTCACCAACCTCTTCCTGTGCGGGCTGTTGKCCAAAGCCAA 
ACAGAAACTCCTGCGGCATCTGGTGCCCGCGGCAGCCCTGAGGAGAAAGCGCAAGGCCCT 
GTGGGCACACCTGTTTTCCAGCCTGCGGGGCTACCTGAAGAGCCTGCCCCGCGTTCAGGT 
CGAAAGCTTCAACCAGGTGCAGGCCATGCCCACGTTCATCTGGATGCTGCGCTGCATCTA 
CGAGACACAGAGCCAGAAGGTGGGGCAGCTGGCGGCCAGGGGCATCTGCGCCAACTACCT 
CAAGCTGACCTACTGCAACGCCTGCTCGGCCGACTGCAGCGCCCTCTCCTTCGTCCTGCA 
TCACTTCCCCAAGCGGCTGGCCCTAGACGTAGACAACAACAATCTCAACGACTACGGCGT 
GCGGGAGCTGCAGCCCTGCTTCAGCCGCCTCACTGTTCTCAGACTCAGCGTAAACCAGAT 
CACTGACGGTGGGGTAAAGGTGCTAAGCGAAGAGCTGACCAAATACAAAATTGTGACCTA 
TTTGGGTTTATACAACAACCAGATCACCGATGTCGGAGCCAGGTACGTCACCAAAATCCT 
GGATGAATGCAAAGGCCTCACGCATCTTAAACTGGGAAAAAACAAAATAACAAGTGAAGG 
AGGGAAGTATCTCGCCCTGGCTGTGAAGAACAGCAAATCAATCTCTGAGGTTGGGATGTG 
GGGCAATCAAGTTGGGGATGAAGGAGCAAAAGCCTTCGCAGAGGCTCTGCGGAACCACCC 
CAGCTTGACCACCCTGAGTCTTGCGTCCAACGGCATCTCCACAGAAGGAGGAAAGAGCCT 
TGCGAGGGCCCTGCAGCAGAACACGTCTCTAGAAATACTGTGGCTGACCCAAAATGAACT 
CAACGATGAAGTGGCAGAGAGTTTGGCAGAAATGTTGAAAGTCAACCAGACGTTAAAGCA 
TTTATGGCTTATCCAGAATCASATCACAGCTWARGGGACTGCCCAGCTGGCAGATGCGTT 
ACAGAGCAACACTGGCATAACAGAGATTTGCCTAAATGGAAACCTGATAAAACCAGAGGA 
GGCCAAAGTCTATGAAGATGAGAAGCGGATTATCTGTTTCTGAGAGGATGCTTTCCTGTT 
CATGGGGTTTTTGCCCTGGAGCCTCAGCAGCAAATGCCACTYTGGGCAGTCTTTTGTGTC 
AGTGTCTTAAAGGGGCCTGCGCAGGCGGGACTATCAGGAGTCCACTGCCTCCATGATGCA 
AGCCAGCTTCCTGTGCAGAAGGTCTGGTCGGCAAACTCCCTAAGTACCCGCTACAATTCT 
GCAGAAAAAGAATGTGTCTTGCGAGCTGTTGTAGTTACAGTAAATACACTGTGAAGAGAC 
TTTATTGCCTATTATAATTATTTTTATCTGAAGCTAGAGGAATAAAGGTGTGAGCAAACA 
GAGGAGGCCAGCCTCACCTCATTCCAACACCTGCCATAGGGACCAACGGGAGCGAGTTGG 
TCACCGCTCTTTTCATTGAAGAGTTGAGGATGTGGCACAAAGTTGGTGCCAAGCTTCTTG 
AATAAAACGTGTTTGATGGATTAGTATTATACCTGAAATATTTTCTTCCTTCTCAGCACT 
TTCCCATGTATTGATACTGGTCCCACTTCACAGCTGGAGACACCGGAGTATGTGCAGTGT 
GGGATTTGACTCCTCCAAGGTTTTGTGGAAAGTTAATGTCAAGGAAAGGATGCACCACGG 

FIG . 10 (Page 2 of 3) 



12 / 35 



WO 99/40102 



PCT/US99/02544 



GCTTTTAATTTTAATCCTGGAGTCTCACTGTCTGCTGGCAAAGATAGAGAATGCCCTCAG 
CTCTTAGCTGGTCTAAGAATGACGATGCCTTCAAAATGCTGCTTCCACTCAGGGCTTCTC 
CTCTGCTAGGCTACCCTCCTCTAGAAGGCTGAGTACCATGGGCTACAGTGTCTGGCCTTG 
3GAAGAAGTGATTCTGTCCCTCCAAAGAAATAGGGCATGGCTTGCCCCTGTGGCCCTGGC 
ATCCAAATGGCTGCTTTTGTCTCCCTTACCTCGTGAAGAGGGGAAGTCTCTTCCTGCCTC 
CCAAGCAGCTGAAGGGTGACTAAACGGGCGCCAAGACTCAGGGGATCGGCTGGGAACTGG 
GCCAGCAGAGCATGTTGGACACCCCCCACCATGGTGGGCTTGTGGTGGCTGCTCCATGAG 
GGTGGGGGTGATACTACTAGATCACTTGTCCTCTTGCCAGCTCATTTGTTAATAAAATAC 
TGAAAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGCGG (SEQ ID NO : 38 ) 
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MEEQGHSEMEIIPSESKPHIQLLKSNRELLVTHIRNTQCLVDNLLKNDYFSAEDAEIVCA 

CPTQPDKVRKILDLVQSKGEEVSEFFLYLLQQLADAYVDLRPWLLEIGFSPSLLTQSKVV 

^/NTDPVSRYTQQLRHHLGRDSKFVLCYAQKEELLLEEIYMDTIMELVGFSNESLGSLNSL 

ACLLDHTTGILNEQAASRKVTGCVCRTCSSSTTATQSGTPRRCLPSCCASPTWPSSPSMA 
WTSCTRTWT (SEQ ID NO: 39) 
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CACGCGTCCGCGCTACTGCGGGAGCAGCGTCCTCCCGGGCCACGGCGCTTCCCGGCCCCG 
GCGTCCCCGGACCATGGCGCTC7CCGGGCTCTTC7CTAGCTCTCAGCGGCTGCGAAG7CT 
GTAAACCTGGTGGCCAAGTGATTGTAAGTCAGGAGACTTTCCTTCGGTTTCTGCCTTTGA 
TGGCAAGAGGTGGAGATTGTGGCGGCGATTACAGAAAACATCTGGGAAGACAAGTTGCTG 
TTTTTATGGGAATCGCAGGCTTGGAAGAGACAGAAGCAATTCCAGAAATAAATTGGAAAT 
TGAAG ATTTAAACAATGTTGTTTTAAAATATT CTAACTTCAAAGAATGATG C CAGAAACT 

TAAAAAGGGGCTGCGCAGAGTAGCAGGGGCCCTGGAGGGCGCGGCCTGAATCCTGATTGC 

CCTTCTGCTGAGAGGACACACGCAGCTGAAGATGAATTTGGGAAAAGTAGCCGCTTGCTA 

CTTTAACTATGGAAGAGCAGGGCCACAGTGAGATGGAAATAATCCCATCAGAGTCTCACC 

CCCACATTCAATTACTGAAAAGCAATCGGGAACTTCTGGTCACTCACATCCGCAATACTC 

AGTGTCTGGTGGACAACTTGCTGAAGAATGACTACTTCTCGGCCGAAGATGCGGAGATTG 

TGTGTGCCTGCCCCACCCAGCCTGACAAGGTCCGCAAAATTCTGGACCTGGTACAGAGCA 

AGGGCGAGGAGGTGTCCGAGTTCTTCCTCTACTTGCTCCAGCAACTCGCAGATGCCTACG 

TGGACCTCAGGCCTTGGCTGCTGGAGATCGGCTTCTCCCCTTCCCTGCTCACTCAGAGCA 

AAGTCGTGGTCAACACTGACCCAGGTAGGAGTCAGCCCCAGCAAGACCGCAGGCACCAGT 

GCAAGCAGGGCCCTGGGGGGTTTGGTAATGGCTGGGCCAGCCCTGAGTGCCACCTCAGGA 

AGCAGGCCCAGGTGCTATTTTGATTTTAGAAAGGAACAGCTGAATCCTGTCTCCCAAGTG 

CAGCCCAGGTGGCTGCGATTGAACTGCCCACACCTCGATGGTCTGGTTTATAGAGGGGCC 

TTTGGAAGTATGGGAATGGCCTGTGTTCTGACCCCTTGCTTTCTTCCTATTCTGACATAT 

GTAGACATTTTAATGGTTGCACAAATTCAAGGTTGTATTTTTTTTTCTTTAAAAAAATCT 

TTAGCTGGACATGGTAGCACACACCTGTAGTTCCAGCTACTCAGGAGGCTGAGGCAAGAG 

GACTGCTTGAGCCCCAGAGTCTAAGGCTGCAGCGAGCTATGATTGTGCCCCTACACTCCA 

CAGCCTGGGTTTTAGAGTGAGACCCTGTCTCTAAAAAAAA 

AAAAAAAAAAAAANG GGCGG (SEQ ID NO:4o) 
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MEEQGHSEMEI I ?SESKPHIQLLKSNRELLVTHIRNTQCL>VDNLLKNDYFSAEDAEIVCA 
CPTQPDKVRKILDLVQSKGEEVSEFFLYLLQQLADAYVDLRPWLLEIGFSPSLLTQSKVV 
VNTDPGRSQFQQDRRHQCKQGPGGFGNGWASPECHLRKQAQVLF 
(SEQ ID NO : 4 i ) 
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CCACGCGTCCGCGGACCGCGAGCGGTAGCGCCCTCCCTCCCAGCTGTTGTCCCGCCCGAT 
CCGCGACCCTAGTCCCCGGATCCCCTTGCTGAGAGTCACCGTACTCCAGGGCCAACTGAG 
CCAAAGTCCTGCCAACTTGGGTCAGCAATGAAAGGCAGGATCCTGGGTGGTGGCCCTGAA 
TCCTGATTTGTCTGCCCTGCCAGCGAGACACATGTGGTCAAAGATGAATTTGAGAAAAGT 
AGCTGCTGGCTACTTGAACAATGGAGGAACACGGCCATCATGAGATGGAAGGCACCCCAT 
TGGGTTGTCACTCCCACATTAAACTGCTGAAGATCAACAGGGAACATCTGGTCACCAACA 
TTCGGAACACTCAGTGTCTGGTGGACAACTTGCTGGAGAATGGCTACTTCTCAGCCGAAG 
ATGCAGAGATTGTGTGTGCCTGTCCCACCAAGCCTGACAAGGTCCGAAAGATCCTTGACC 
TGGTGCAGAGCAAAGGCGAGGAGGTGTCTGAGTTCTTCCTCTACGTGCTGCAGCAGCTGG 
AGGATGCTTACGTGGACCTCAGGCTGTGGCTCTCAGAAATTGGCTTCTCCCCTTCCCAGC 
TCATTCGGACCAAAACTATCGTCAATACTGACCCAGTAAGCAGGTATACCCAACAGCTGC 
GACACCAACTGGGCCGCGACTCCAAGTTCATGCTGTGCTACGCCCAGAAGGAGGACCTGC 
TGCTGGAGGAGACCTATATGGACACACTCATGGGGCTGGTAGGCTTCAACAATGAAAACC 
TGGGCAGCCTAGGAGGCCTGGATTGCCTGCTGGACCACAGTACGGGCGTCCTCAACGAGC 
ATGGCGAGACTGTCTTCGTGTTCGGGGACGCGGGAGTGGGCAAGTCCATGCTGCTGCAGA 
GGTTGCAGAGCCTCTGGGCGTCAGGCAGGTTGACCTCCACAGCCAAATTCTTCTTCCACT 
TCCGCTGCCGCATGTTCAGCTGCTTCAAGGAGAGCGACATGCTGAGTCTGCAGGACCTGC 
TCTTCAAGCATTTCTGCTACCCGGAGCAGGACCCCGAGGAGGTGTTCTCCTTCTTGCTGC 
GCTTTCCCCACACAGCGCTCTTCACTTTTGACGGCCTGGATGAGCTGCACTCAGACTTCG 
ACCTGAGCCGCGTGCCGGATAGCTGCTGCCCCTGGGAGCCGGCTCACCCTCTGGTCCTGC 
TGGCTAACCTCCTAAGTGGGAGGCTGCTCAAGGGTGCCGGCAAATTGCTCACTGCTCGCA 
CAGGCGTGGAGGTCCCCCGCCAGCTCCTGCGCAAAAAGGTGCTGCTCCGGGGCTTCTCCC 
CAAGTCACCTGCGCGCCTATGCCCGCCGGATGTTCCCCGAGCGCACAGCGCAGGAGCATC 
TGCTGCAGCAGCTGGATGCCAACCCCAACCTCTGCAGCCTGTGCGGGGTGCCGCTCTTCT 
GTTGGATCATCTTCCGTTGTTTCCAGCACTTCCAGACGGTCTTCGAGGGCTCCTCTTCAC 
AGTTGCCGGACTGTGCTGTGACCCTGACCGATGTCTTTCTGCTGGTCACTGAGGTGCATC 
TGAACAGGCCGCAGCCCAGCAGCCTGGTGCAGCGCAACACGCGCAGCCCGGCGGAAACCC 
TACGTGCAGGCTGGCGCACGCTGCATGCGCTGGGAGAGGTGGCTCACCGAGGCACCGACA 
AGAGCCTCTTTGTGTTTGGCCAGGAGGAGGTGCAGGCGTCGAAGCTGCAGGAAGGAGATC 
TGCAGCTGGGCTTCCTGCGGGCTTTGCCCGATGTGGGCCCTGAGCAGGGCCAGTCTTACG 
AATTTTTCCACCTTACGCTCCAGGCCTTCTTCACCGCCTTCTTCCTGGTAGCAGATGACA 
AAGTGAGCACCCGGGAGTTGCTGAGGTTCTTTCGAGAATGGACGTCTCCTGGAGAGGCAA 
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CAAGCTCGTCCTGCCATTCTTCCTTCTTCTCCTTCCAGTGCCTGGGCGGCAGAAGCCGGT 
TGGGCCCTGATCCTTTCAGGAACAAAGATCACTTCCAGTTCACCAACCTCTTCGTGTGCG 
GGCTACTGGCCAAAGCCCGACAGAAACTCCTTCGGCAGCTGGTGCCCAAGGCTATCCTGA 
GGAGGAAGCGCAAGGCCCTGTGGGCTCACCTGTTTGCTAGCCTGCGCTCCTACTTGAAGA 
GCCTACCTCGGGTCCAGTCTGGAGGCTTTAACCAGGTGCATGCCATGCCCACATTCCTGT 
GGATGCTGCGCTGCATCTATGAGACGCAGAGCCAGAAGGTGGGGCGCCTCGCCGCCAGGG 
GCATCAGTGCGGACTACCTCAAGCTGGCCTTTTGCAACGCTTGCTCTGCGGACTGCAGCG 
CCCTGTCCTTCGTCCTGCATCACTTCCACAGGCAGCTGGCCCTAGACCTGGACAACAACA 
ACCTCAATGACTATGGCGTGCAGGAGCTGCAGCCTTGCTTTAGCCGTCTCACGGTTATCA 
GACTCAGCGTCAACCAGATCACCGACACGGGGGTGAAGGTGCTATGTGAGGAACTGACCA 
AGTATAAGATCGTGACGTTCCTGGGTTTATACAACAACCAGATAACTGATATCGGAGCCA 
GGTATGTGGCCCAAATCCTGGATGAATGCAGAGGCCTCAAGCACCTTAAACTAGGGAAAA 
ACAGAATAACAAGTGAGGGCGGGAAGTGTGTGGCTTTGGCTGTGAAGAACAGCACCTCCA 
TCGTTGATGTTGGGATGTGGGGTAATCAGATTGGAGACGAAGGGGCAAAGGCCTTCGCAG 
AGGCATTGAAGGACCACCCCAGCCTGACCACTCTCAGTCTTGCATTCAATGGCATCTCTC 
CGGAGGGAGGGAAGAGCCTTGCGCAGGCCCTGAAGCAGAACACCACACTGACAGTAATCT 
GGCTGACCAAAAATGAACTTAATGATGAGTCTGCAGAGTGCTTCGCTGAGATGCTGAGAG 
TGAACCAGACGCTACGGCATTTATGGCTGATCCAGAATCGCATCACAGCCAAGGGGACAG 
CGCAGCTGGCGAGGGCACTGCAGAAGAACACAGCCATAACAGAGATTTGTCTCAATGGAA 
ACTTGATTAAGCCCGAGGAGGCCAAAGTCTTCGAGAATGAGAAGAGAATCATCTGCTTCT 
GACGGACGCTCCTGGGCAGGATCTTTGTCCTAGGTTGCTCCTCAGTCACAGACAGCACTG 
TGCAGTCAGCAGGGTAGCAGGATGCTGTGCAGCGCCTGCAGCAAGGTGCCTGTCAGGAGC 
CCACACCTCCACAGTGCACACCGATGTCCCCTGCTCATGCTTGGACTGGTAGCACCCGCG 
CCGCGGCTGAGACCCTGCAGACGCAGGGAGTCTTAGGAACCATCGTCACCACTCAAAGCC 
AGCAGGGCATCTTCTGTACAAAGATCTCCCTGCATATCCACTAGACGGAAGCTGAAGGAA 
CGCAACAGCAGAGGAGGCCAACAGACGCCTGGCTGAAGGCTCCGTGGGACCAACGGTGTC 
ACCTTCAGAAAAGAGCTGGGAACTTGAGCAGAGCCGATGGTAACTTCTTGGGGAAAGAAG 
GCACCCAGTGACTGCATGGTTATTCTGAGTCCTCCTTCCTCTGCTTAGTCCCTCTCACTG 
TACAGGTCTGTTTCTTCCTCGCAGCTGTGGCTGCTGAAGTAGGTCCACTGTGGGGAGAGC 
TCATCACAGACTTTGGTTCGGTTCTGGATTCTCAGTGGTGGCAACCGAGAGTCAGACGAT 
ACCCTCTAGGTCAGTCTCAGAGGATCTCTATGCTGTGAGAGGGTTGAGGGCCCACCCAGA 
ATTTTTTTTTTTTACCAGTTTTTACTGTGCCTGCCCCAGGAGGGAGAATTACTTCCCAGC 
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CTCCACAGCAGCAGGCATGGCTTGCCTCAATGGTCCTGAGATCCCAACAAAACTCTCTCC 

CTTGCCTGTGAGCAGAAAGTATCTTCATGTCCTCAGAAGTTGGAGGGTGACTGGACACAG 

TTAAGACT CAG AG AG C CAG CT G AT AG CTCAAAGCAAAGCATGGCACATACCC AC C AC CAT 

ACCATGGTGCGCATGGGATGGGACAGTTGGAATGTTGCAGATAACGTGTTCTTTTGCCAG 

TTCATTTGTTAATAAAATATTTAAAACGTTAAAAAAAAAAAAAAAAAA 

G (SEQ ID NO: 42 ) 
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MEEHGKKEHEGTPLGCHSHIKLLKINREHLVTNIRNTQCLVDNLLENGYFSAEDAEIVCA 

CPTKPDKVRKILDLVQSKGEEVSEFFLYVLQQLEDAYVDLRLWLSEIGFSPSQLIRTKTI 

^/NTDPVSRYTQQLRHQLGRDSKFMLCYAQKEDLLLEETYMDTI^MGLVGFNNENLGSLGGL 

DCLLDHSTGVToNEHGETVFVFGDAGVGKSMLLQRLQSLWASGRLTSTAKFFFHFRCRMFS 

CFKESDMLSLQDLLFKHFCYPEQDPEEVFSFLLRFPHTALFTFDGLDELHSDFDLSRVPD 

SCCPWEPAHPLVljIx^LLSGRLLKGAGKLLTARTGVEVPRQLLRKKVLLRGFSPSHL 

ARRMFPERTAQEHLLQQLDANPNLCSLCGVPLFCWIIFRCFQHFQTVFEGSSSQLPDCAV 

TLTDVFLLVTEVHLNRPQPSSLVQRNTRSPAETLRAGWRTLHALGEVAHRGTDKSLFVFG 

QEEVQASKLQEGDLQLGFLRALPDVGPEQGQSYEFFKLTLQAFFTAFFLVADDKVSTREL 

LRFFREWTSPGEATSSSCHSSFFSFQCLGGRSRLGPDPFRNKDHFQFTNLFVCGLIJVKAR 

QKXIJIQLVPKAILRRKRKALWAHLFASLRSYLKSLPRVQSGGFNQVHAMPTFLWMLRCIY 

ETQSQKVGRLAARGISADYLKIAFCNACSADCSALSFVLHHF 

QELQ P CFS RLTVI RLS VNQ I TDTGVKVLCEELTKYKI VTFLGL YNNQ I TD I GARYVAQ I L 
DECRGLKHLKLGKNRI TS EGGKCVALAVKNS TS I VDVGMWGNQ I GDEGAKAFAEALKDHP 
SLTTLSLJIFNGISPEGGKSLAQALKQNT^^ 

LWLIQNRITAKGTAQLARALQKNTAITEICLNGNLIKPEEAiCVFENEKRIICF 
(SEQ ID NO: 43) 

FIG. 16 



24 / 35 



WO 99/40102 



PCT/US99/02544 
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gatcatcgttcactgcagccttgaactcttgtgctcatgtgatcctcctgccttagcctccccaa 
tagctgggactacaggtgcgccaccatgcctggctaattttttttatttttgtagagatgggtgt 
ctcactatgttgcacaggttggtctcaaactactggccttacttcaagctatctacccatctcag 
cctcccaaagcgctgggattacagtcatgagccaacttgcctggccagataaaggtcttaagcat 
ggttccttcctgctctaggtagagaaaccccacaaccagtgggaggtggggtgagctctttctgt 
agcttttgctttgctgatgatgtcattgatctcttcaggggctgcgcagagtagcaggggccctg 
gagggcgcggcctgaatcctgattgcccttctgctgagaggacacacgcagctgaagatgaattt 
gggaaaagtagccgcttgctactttaactatggaagagcagggccacagtgagatggaaataatc 
ccatcagagtctcacccccacattcaattactgaaaagcaatcgggaacttctggtcactcacat 
ccgcaatactcagtgtctggtggacaacttgctgaagaatgactacttctcggccgaagatgcgg 
agattgtgtgtgcctgccccacccagcctgacaaggtgccccggggacagggacgggcatggcat 
tgtgtggaccccgggagctagaagaggcctctccctgctgatctgagtgaagagcgtgggagttt 
agtccagcgggcagggctgcattttggggtactaatagcacacaaatgcctgggttagcaggttg 
cacagtcaggtattttacttctgtgtttgtgtctggagcaaaccctgacatctcagttctcattg 
ctgtgtgtattggttcccagacacttcatttttagatcccctttaaattaggagggaaaaagaac 
ataagcataagagcatccccagcagcgatgttcattcagtgcctctgaaggctggagggctgctt 
gttgctgggtgagactcggaggggaaccgactcagggtcaggaatgatgacatcccacggtgggt 
ccacagtgaagaatcttccccgctccactgtgggacgccttaacagcccttacttccacttacgc 
tttgcgttatctcctgaaaaataaaatggagaccacaaattccttcttggttagaggaatgacac 
aactcatttatgacatgaccccgctgggactcagaagagaccaggacggtttctgggggaagcag 
tagcacactcgtgtgctttgttctcttctcttgatttgttttcccacatttttaacaagaaaaaa 
agccgtttttaatatatggcctatcgccctcctactgtgtggcccaggtgcctacctcattatgc 
ccaaggggtggttctcacctctccactctcattcctgcacagcagttgtgtcaggttaagaggga 
caaggagaaggctgggcaccgtggctcacgcctgtaatcccagcactttgggaggccgaggcagg 
cagatcacctaaggtcaggagtttgagaccagcctggccaacatggggaaaacccgtctctaata 
aaaacacaaaaattagtcgggcatggtggtgggtgcctgtaatcccagccacttgggaggctgag 
gaaagagaattccttgaacctgggaggtggaggttgcagtgagccaagattgtgccattgcactc 
cagccctccagcctgggtgacagagcaagactctgtctcaaaaaagaaaaaaaaaaaaaagaggt 
agagaagtccatggctatttgtctgtcctttttatttttaggctcatggaagcctcctggtttct: 
tagagctgagtggttttatttcttgctcaggaggtcatttcacagattttcgggctccaatatgt 
tgactgtcacagcagctggggggatggcatagctaccggctgtactaagaactcagagccctgcc 
ctgagcctgcctgagggtccttatggtaggaggatgcccctcatgccagcccgtgccctcatgct 
tgtgtcacctccaggtccgcaaaattctggacctggtacagagcaagggcgaggaggtgtccgag 
ttcttcctctacttgctccagcaactcgcagatgcctacgtggacctcaggccttggctgctgga 
gatcggcttctccccttccctgctcactcagagcaaagtcgtggtcaacactgacccaggtagga 
gtcagccccagcaagaccgcaggcaccagtgcaagcagggccctggggggtttggtaatggctgg 
gccagccctgagtgccacctcaggaagcaggcccaggtgctattttgattttagaaaggaacagc 
tgaatcctgtctcccaagtgcagcccaggtggctgcgattgaactgcccacacctcgatggtctg 
gtttatagaggggcctttggaagtatgggaatggcctgtgttctgaccccttgctttcttcctat 
tctgacatatgtagacattttaatggttgcacaaattcaaggttgtatttttttttcttttaaaa 
aaatctttagctggacatggtagcacacacctgtagttccagctactcaggaggctgaggcaaga 
ggactgcttgagccccagagtctaaggctgcagcgagctatgattgtgcccctacactccagcct 
gggtgacagagtgagaccctgtctctaaaaaaggaaagaaaaaaattaaaaagccttgccaggtt 
tgattctaggcaaagtattctgtcaccgttgagtgccagtccttatttccaaactaatggaagac 
cccatcagttaactgattagttcaataagtattttttgctgtatccaccacatgccaagacccta 
cactgtgctggatgtcagggagacagtggtgagcagacacagacagggttcctgccctcagggag 
cttcaagtcagctggaagagaccaccagtcagcaatctcaaaaatgtgtcaggacagcggcagtc 
caaggcatgtgagaacatatcattagggccaggatctgctctggggcaggagtcttctttccctg 
cttttgaactctccactttgagacagctgttggtaacataccagcaccaaggacctaagtcctgc 
cttttaaagaatccaatatgttgttggaaacagaagcacaagacaggtgtgtgcttaggggaaac 
aaggccagccggcagagtgtcagtgctaggctccagcttccacagcccctgcaggtgcctgccag 
ccactgctagcttctgactctgtctgctccttcctgtctccccttgtttccttcccccatgaaaa 
aaaaagaaagtattcccatgaggaatcattctttcgaaagacttctctgttggttccgttagcca 
gctactttactagcttttacagtgtaattcactctacaagcagtctcacacaaaagactacatat 
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tgtatgattctgtttatatgaaatgtccagaaaaggtiaaatctatagacaaagcaaatcagtagt 
tgcctacggcccagggattggctacaaataggctccagaaaactctgggaagatggtagagatgt 
tctagacctggactgtggtgaggtttgcacaactttgtaaacttactaaaaattactgacaaata 
tataacactccctaacactttgggaggccgaggtgggcagatcgcttgaacccaggaatttgaga 
ccagcctgggcaacatggcgagaccccgtctctacaaaaaaacacaaaaattagttgggcttggt 
ggcatatgcctgtgtcccagctacttgggaggctgaggtgggaggattgcttgagcctgggagtt 
tgagactgcatgattgggtcactgcaccctagcctgagtgacagagcaggaccctatctctaaca 
acaaaaaagcagtgttggtggaggagggccagcgtggccatctggcctggccctcgagtgcgagg 
ggcttcagtgtttagctgcagttcagtgatgacactgtgcggaggaataagggtggcctgtctca 
gacactgatcccagctgaagtttgtcaccttctttctggcaaatctgaggtcaagcagagagatc 
aaagcctggggccctcagggtcaggaatgctggctctgtgacgctccccaggtcctgcatctgag 
gagtggctgcgctggcctcagggcccaggttgtgaattttgtttatgcactcgcctctcctcttt 
gagacctccctgtttgatgctgtttctgcctctctcctcaccctgctgctgtgccctgccacccc 
ctccctccagtgagcaggtatacccagcagctgcgacaccatctgggccgtgactccaagttcgt 
gctgtgctatgcccagaaggaggagctgctgctggaggagatctacatggacaccatcatggagc 
tggttggcttcagcaatgagagcctgggcagcctgaacagcctggcctgcctcctggaccacacc 
accggcatcctcaatgagcagggtgagaccatcttcatcctgggtgatgctggggtgggcaagtc 
catgctgctacagcggctgcagagcctctgggccacgggccggctagacgcaggggtcaaattct 
tcttccactttcgctgccgcatgttcagctgcttcaaggaaagtgacaggctgtgtctgcaggac 
ctgctcttcaagcactactgctacccagagcgggaccccgaggaggtgtttgccttcctgctgcg 
cttcccccacgtggccctcttcaccttcgatggcctggacgagctgcactcggacttggacctga 
gccgcgtgcctgacagctcctgcccctgggagcctgcccaccccctggtcttgctggccaacctg 
ctcagtgggaagctgctcaagggggctagcaagctgctcacagcccgcacaggcatcgaggtccc 
gcgccagttcctgcggaagaaggtgcttctccggggcttctcccccagccacctgcgcgcctatg 
ccaggaggatgt tccccgagcgggccctgcaggaccgcctgGtgagccagctggaggccaacccc 
aacctctgcagcctgtgctctgtgcccctcttctgctggatcatcttccggtgcttccagcactt 
ccgtgctgcctttgaaggctcaccacagctgcccgactgcacgatgaccctgacagatgtcttcc 
tcctggtcactgaggtccatctgaacaggatgcagcccagcagcctggtgcagcggaacacacgc 
agcccagtggagaccctccacgccggccgggacactctgtgctcgctggggcaggtggcccaccg 
gggcatggagaagagcctctttgtcttcacccaggaggaggtgcaggcctccgggctgcaggaga 
gagacatgcagctgggcttcctgcgggctttgccggagctgggccccgggggtgaccagcagtcc 
tatgagtttttccacctcaccctccaggccttctttacagccttcttcctcgtgctggacgacag 
99 t 999 cactca 99 a 9 ct: 9 c tcaggttcttccaggagtggatgccccctgcgggggcagcgacca 
cgtcctgctatcctcccttcctcccgttccagtgcctgcagggcagtggtccggcgcgggaagac 
ctcttcaagaacaaggatcacttccagttcaccaacctcttcctgtgcgggctgttgtccaaagc 
caaacagaaactccttcggcatctggtgcccgcggcagccctgaggagaaagcgcaaggccctgt 
gggcacacctgttttccagcctgcggggctacctgaagagcctgccccgcgttcaggtcgaaagc 
ttcaaccaggtgcaggccatgcccacgttcatctggatgctgcgctgcatctacgagacacagag 
ccagaaggtggggcagctggcggccaggggcatctgcgccaactacctcaagctgacctactgca 
acgcctgctcggccgactgcagcgccctctccttcgtcctgcatcacttccccaagcggctggcc 
ctagacctagacaacaacaatctcaacgactacggcgtgcgggagctgcagccctgcttcagccg 
cctcactgttctcaggtgaggctgccaggcaaggggagcaacaggtgggccgggcgggccaggct 
cggagggcatcgggaatggcatcatggaccaggatcccccaggactcatgaccatggcccttgga 
atgtccagaccttttccttcttagcagggcagaggtcaaggtgcaaagcttcgaggcaggtggac 
ctggatcagccacagctgggtgcccttgaacaaagtgcttaactctcagagcctccacgccctca 
tctggaaaaagaagatgctcataatcctatcaattatggccacagggaccaatgttagttgagaa 
tgggtgaagtgcattacaaatattacctaatggaatgctctttacaaccctgtaacttaggtact 
gttattgtctctattttggcagataaggaagtagaggcacagagaagttaatagcttgctttagg 
tcacacagctcagacatagcagtgccagaatgcataaagaaccttccttttaagattaatgtaag 
gctccgagatagccctcaaaaagtttctggaatatgggagcttttattactgcagagaaagcaga 
ccttgtgccagttggcactggtgactttctgtgatcaacgctagcagcccttcacactgctagag 
acctcagttaaaatgctgactcgtggttgttttcctgttccatagtttacgggaaacagagccca 
gtctgttttcttctattagcatttcctatgtaaaataaaccttgtaaatctctacagggggttaa 
atttgccattacttgactcatgcatttctaaaaagcagtagggatttggaactgactcccagtgc 
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ctgtcacaccagtgtcagagtgtaaataattgcatggggacatggggtgcagggggtcgaaggct 
gccctagcctgggaattggaaaacctggagtctgttctctgtactctcagccagtgactctccct 
ctgtagccccaggcagtctcacactcagtgccaccctctgtccatcttttttttttctcccccaa 
atggagtcccgctctgttgcccaggctggagtgcagtggcgtgatctcagctcactgcaacctcc 
gcctcctgggttcaagcgattctcttgccccagcctcctgagtagctgggattacaggcacacgc 
caccatgtccggctaagttttttgtatttttagtaggacggggtttccccatgttggccaggctg 
gtcttgaaatcctgacctcaggtgatccgcccgcctcggccttccaaaatgttggggttacaggc 
atgagccgccgcacccgacccctctgtccatcttttcaatgggaaactccacaccagtgtggtgg 
ccctgcccttcctgctgtccccaggtgaagctttccttcacaccagtgcaagaaaaaacagcttg 
taggaaagcagaggatatgggtaaccacgggaagcacactcagttctctggctgcatcagttagg 
attagttttagctgagagcgaaaaccccaaatgttggtgagttacaagcttatttctctcatgta 
aaagtctagaggtaggtagttcaggactggtatggagtctccatgaccctccggagcccaggctc 
tcttctgccttcctgttctgccatcctcactacccggctttcccatcttggcccaagagggctgc 
tcaaactccagccatctagtcgacactctagctatcagtaagaaggaagggcaaagattgagagc 
atgcctcaatcttttaagaacacttcttggctattactaattatattgctgcttagatttcagaa 
cttaatggtatgggcagaatttaatgagatgggcccagctaaaagatgggggaatctattgctaa 
gaaagtatagatattgggaatgtctagcagcctgtgctgtcttgggctggccatgccatgtacat 
acacactatttcccagcaccaagctggggactctgagggaaagggtccagagtgtctgacttgat 
cat tttgatgtggcctaaaaatcaagcttttaattgttcagccttt tact tgttatcaaggtcag 
cttgtgggtctaattgggcccaaggcttgtgtttctaagtaaagttttattggaacgcagccata 
cccatttatttacttactggctgcttcacactacacagttgagtagctgtgacagagaccacatg 
gcccacagagcctaaaatatttgctgtctgacactttacagaatgacatgagcagtctcctttga 
cagtgggactcacagccttttccagtgacaaatcagggttagcccatgtgtttctggatgggggg 
aagctgttggcattttgggtataacagttcttgtgagacctgtccagcattttgcaggacaccta 
acatcattggccctgcctgcaagatgacagggcactccctcctccagtcacaaccactaaaagca 
gcccctgacatttccaaacccatgccctccaccatacgagaaccaggtacagggtctggctgaca 
cataggtcacacgcaaagggtggatgtcagaggtggctggcctcacacgtcctccctgtgtcctt 
cacggtcgtgtgaggagccaggggctgtgctgcagcctcgctcatgggctggtgcaggatgggtc 
tggcggccccacgttggccaggctttgtaaggggctatttggctgattgctgtggccattctcca 
ggggcgtctatacctgagaaaactccagggcctgaaggcttctggatctttgtaagattaatggt 
ccttcataatgagtgcctgccctgactcgtaatttttttgctgttttatttcagactcagcgtaa 
accagatcactgacggtggggtaaaggtgctaagcgaagagctgaccaaatacaaaattgtgacc 
tatttggggtatgtctttctccagaacactgggccaactacctagtaataatacagagctgcagg 
gaattcacattcccataggtccctggatgatcggcacggatggcccagggctgggaagagcgctg 
gcccaggagttgagagtcctgggttctctttgtggctcggccagtcatgaagtcttgctgagcct 
cagcctcctcacctgtaaaactgggatcccagtataggcaagtaggcttacaactggttattggg 
ggatgcaacgagaatataaggggatatatttaataaatgctagaatcctgtttacatattagtct 
ggactattttgggtccataatccctcatccagagcctttggggcaagacccgaatggggattctg 
agtgcatgctatggcatgacgtggccgcaggggtctaaggcagtgccccattttcaaacactttc 
atatttctcccgcagaatgtatgaaacagtcaaaccaagtgtggtaagaaagactataagtagct 
ccacatcagttgccaaaagaattgtgagaaactttgggcattcagagcctttgaggttttggagt 
ctgagagaagggattgcgggccagccccacacaactggtggctctgcaagctggagcagttgttc 
agtttcttggggcctcagtggccttcgatgttaatgaggacatggacgcaaacgaccccgggcca 
cactcggctccagggctctgtgtggctgtggaaccctggaagcctgagcttagctgcctttcaac 
ttccatctgctgtactattgaattggcattgagcggtgagatggctgaaaggtagacatcgagaa 
gttttaatattcagaatcttttcttctcaagacgctgaatgtaatcttagttgtaaatacccatc 
acctgccagtcaccgagcactcatgcaccagggctttgcgttatgtcctaagatcctcataacca 
ccctgcaaggggactatcatcattacctctgtattacagatggagaaactgaggcacagagaggt 
aacgtgacttgtctcaggccataaagctggggaaagtagtggagctggttttgaacctgagctgt 
gagacctcagagccctaaactctggtgcctgtgtgttcccctttcaacccagactttggaaatca 
gtagacaccatatgcttcaaaaaacaggggctattaaaatgacatcaggagccagaaagtctcat 
ggctgtgctttctcttgaagtttatacaacaaccagatcaccgatgtcggagccaggtacgtcac 
caaaatcctggatgaatgcaaaggcctcacgcatcttaagtaagtggggtaggcaccaggttcct 
tagtatattctcttgatcacccccttctgttgttcaaagattaaatgtcacagtaaagagctttc 
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atcctaaagccttccacttgtcccagggccatgttggtcaagtaaagatacctctgtgtgatctg 
tgaggcttggattctggaagggcctcccgctattggtagggggaaaggttggcatt.ttgatttca 
ttaactactaggccgaagaaaggactaactctcaccctttctggtggtctttttgccccaaggga 
gtttcctgtcgggttgcaaggaagagcttgggcccttgccctgctgtaggtgtgccctgcgcagg 
gggtgacagtgcgccaggcttggagcctctggtcctgccctgacagtggccacataccttgaccc 
ttggcagtcaaagtgggacctcccaggtctcccgagggaagtcagtgatgctgctgaggtcaatt 
agaggaccccagggagggctcaggtccctgagcttctgcagagactgtggaccatctcctggaga 
ggaaccctgactgactgtcctcagggcttcagttccctccctgacaggaggcccaggccatggct 
cttgtggatcccagaagaaagtgtacggttcccaagatggggctggaaggggctctgtgctgggg 
aggagggtgacccacattggagcccctgcatagctggaggctgactgtgtgtgactctctctgca 
gactgggaaaaaacaaaataacaagtgaaggagggaagtatctcgccctggctgtgaagaacagc 
aaatcaatctctgaggttgggtgagtagaaggggatggatgtatgtggtacaacctgctgtgtgt 

9 t 9g9999 c 999 cctt 9 ct 9 ttct ^ ttcatacatca 9 tacacca 9 aa 99 accact 9999 ctc 9 ct 
gtcggggagagatagtggagagctttcaccatgctgcgaaactgaaaccgtgcccattaagcaat 
aactccccggtccccctcccccctgcctcttgcagccaccctgctacttactctctctatggttt 
tgactactctacctcatgtaagtggaatcatacagtatttgccttttggggatggctgatttcac 
tagcatcatgtcctcaagattcgtccacatggaagcatgggacaggatttccttttttttttttt 
ttttttttttttttgacagagtctcgctctgttgcccaggctggagtgcagtggcatgatctcgg 
ctcactgcaacctctgccttctgggttcaagcgattctctcgcctcagccacacgagtagctggg 
attataggcacccgccaccaatcccagctaatttttgtatttttagtagaggcggggtttcacca 
tgttggccaggctggtctcaaactcctgacctcaaatgatccacccacctcggtctcccaaagtg 
tcaggattataggcgtgagccaccgtgccccgccaggatttccttcttttttaaggctgagtaat 
actccattgcatggctatgccacattttgtttactcattcatccaagaacagacactggcttgct 
tctatgctttggctgttgtgaataatgctgctgtgcacatgggcatacaaatgtctcttcaagga 
ctgccttcaattcttttttttttttttttttttttttagattctttttttttttattatactcta 
agttttagggtacatgtgcacattgtgcaggttagttacatatgtatacatgtgccatgctggtg 
cgctgcacccactaatgtgtcatctagcattaggtatatctcccaatgctatccctcccccctcc 
cccgaccccaccacagtccccagagtgtgatattccccttcctgtgtccatgtgatctcattgtt 
caattcccacctatgagtgagaatatgcggtgtttggttttttgttcttgcgatagtttactgag 
aatgatggtttccaatttcatccatgtccctacaaaggatatgaactcatcattttttatggctg 
catagtattccatggtgtatatgtgccacattttcttaatccagtctatcattgttggacatttg 
ggttggttccaagtctttgctattgtgaatagtgccacaataaacatacgtgtgcatgtgtcttt 
atagcagcatgatttatactcatttgggtatatacccagtaatgggatggctgggtcaaatggta 
tttctagttctagatccctgaggaatcgccacactgacttccacaatggttgaactagtttacag 
tcccaccaacagtgtaaaagtgttcctatttctccgcatcctctccagcacctgctgtttcctga 
ctttttaatgattgccattctaactggtgtgagatgatatctcatagtggttttgatttgcattt 
ctctgatggccagtgatgatgagcatttcttcatgtgttttttggctgcataaatgtcttctttt 
gagaagtgtctgttcatgtccttcgcccactttttgatggggttgtttgtttttttcttgtaaat 
ttgtttgagttcattgtagattctggatattagccctttgtcagatgagtaggttgcgaaaattt 
tctcccatgttgtaggttgcctgttcactctgatggtagtttcttttgctgtgcagaagctcttt 
agtttaattagatcccatttgtcaattttgtcttttgttgccattgcttttggtgttttggacat: 
gaagtccttgcccacgcctatgtcctgaatggtaatgcctaggttttcttctagggtttttatgg 
ttttaggtttaacgtttaaatctttaatccatcttgaattgatttttgtataaggtgtaaggaag 
ggatccagtttcagctttctacatatggctagccagttttcccagcaccatttattaaataggga 
atcctttccccattgcttgtttttctcaggtttgtcaaagatcagatagttgtagatatgcggca 
ttatttctgagggctctgttctgttccattgatctatatctctgttttggtaccagtaccatgct 
gttttggttactgtagccttgtagtatagtttgaagtcaggtagtgtgatgcctccagctttgtt 
cttttggcttaggattgacttggcgatgcgggctcttttttggttccatatgaactttaaagtag 
ttttttccaattctgtgaagaaagtcattggtagcttgatggggatggcattgaatctgtaaatt 
accttgggcagtatggccattttcacgatattgattcttcctacccatgagcatggaatgttctt 
ccatttgtttgtgtcctcttttatttccttgagcagtggtttgtagttctccttgaagaggtcct 
tcacatcccttgtaagttggattcctaggtattttattctctttgaagcaattgtgaatgggagt 
tcacccatgatttggctctctgtttgtctgttgttggtgtataagaatgcttgtgatttttgtac 
attgattttgtatcctgagactttgctgaagttgcttatcagcttaaggagattttgggctgaga 
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cgatggggttttctagataaacaatcatgtcgtctgcaaacagggacaatttgacttcctctttt 
cctaattgaataccctttatttccttctcctgcctgattgccctggccagaacttccaacactat 
gttgaataggagcggtgagagagggcatccctgtcttgtgccagttttcaaagggaatgctitcca 
gtttttgcccattcagtatgatattggctgtgggtttgtcatagatagctcttattattttgaaa 
tacgtcccatcaatacctaatttattgagagtttttagcatgaagggttgttgaattttgtcaaa 
ggctttttctgcatctattgagataatcatgtggtttttgtctttggctctgtttatatgctgga 
ttacatttattgatttgcgtatattgaaccagccttgcatcccagggatgaagcccacttgatca 
tggtggataagctttttgatgtgctgctggattcggtttgccagtattttattgaggatttttgc 
atcaatgttcatcaaggatattggtctaaaattctcttttttggttgtgtctctgcccggctttg 
gtatcagaatgatgctggcctcataaaatgagttagggaggattccctctttttctattgattgg 
aatagtttcagaaggaatggtaccagttcctccttgtacctctggtagaattcggctgtgaatcc 
atctggtcctggactctttttggttggtaaactattgattattgccacaatttcagagcctgtta 
ttggtctattcagagattcaacttcttcctggtttagtcttgggagagtgtatgtgtcgaggaat 
gtatccatttcttctagattttctagtttatttgcgtagaggtgtttgtagtattctctgatggt 
agtttgtatttctgtgggatcggtggtgatatcccctttatcattttttattgtgtctatttgat 
tcttctctctttttttctttattagtcttgctagcggtctatcaattttgttgatcctttcgaaa 
aaccagctcctggattcattgattttttgaagggttttttgtgtctctatttccttcagttctgc 
tctgattttagttatttcttgccttctgctagcttttgaatgtgtttgctcttgcttttctagtt 
cttttaattgtgatgttagggtgtcaattttggatctttcctgctttctcttgtaggcatttagt: 
gctataaatttccctctacacactgctttgaatgcgtcccagagattctggtatgtggtgtcttt: 
gttctcgttggtttcaaagaacatctttatttctgccttcatttcgttatgtacccagtagtcat 
tcaggagcaggttgttcagtttccatgtagttgagcggctttgagtgagattcttaatcctgagt 
tctagtttgattgcactgtggtctgagagatagtttgttataatttctgttcttttacatttgct 
gaggagagctttacttccaactatgtggtcaattttggaataggtgtggtgtggtgctgaaaaaa 
atgtatattctgttgatttggggtggagagttctgtagatgtctattaggtctgcttggtgcaga 
gctgagttcaattcctgggtatccttgttgactttctgtctcattgatctgtctaatgttgacag 
tggggtgttaaagtctcccattattaatgtgtgggagtctaagtctctttgtaggtcactgagga 
cttgctttatgaatctgggtgctcctgtattgggtgcataaatatttaggatagttagctcctct 
tgttgaattgatccctttaccattatgtaatggccttctttgtctcttttgatctttgttggttt 
aaagtctgttttatcagagactaggattgcaacccctgcctttttttgttttccattggcttggt 
agatcttcctccatccttttattttgagcctatgtgtgtctctgcacgtgagatgggtttcctga 
atacagcacactgatgggtcttgactctttatccaacttgccagtctgtgtcttttaattgcaga 
atttagtccatttatatttaaagttaatattgttatgtgtgaatttgatcctgtcattatgatgt 
tagctggcgattttgctcattagttgatgcagtttcttcctagtctcgatggtctttacattttg 
gcatgattttgcagcggctggtaccggttgttcctttccatgtttaccgcttccttcaggagctc 
ttttagggcaggcctggtggtgacaaaatctctcagcatttgcttgtctataaagtattttattt 
ctccttcacttatgaagcttagtttggctggatatgaaattctgggttgaaaattcttttcttta 
agaatgttgaatattggcccccactctcttctggcttgtagggtttctgccgagagatccgctgt 
tagtctgatgggctttcctttgagggtaacccgaactttctctctggctgcccttaacatttttt 
ccttcatttcaactttggtgaatctgacaattatgtgtcttggagttgctcttctcgaggagtat 
ctttgtggcgttctctgtatttcctgaatctgaacgttggcctgccttgctagattggggaagtt: 
ctcctggataatatcctgcagagtgttttccaacttggttccattctccacatcactttcaggta 
caccaatcagacgtagatttggtcttttcacatagtcccatatttcttggaggctttgctcattt 
ctttttattcttttttctctaaacttcccttctcgcttcatttcattcatttcatcttccattgc 
tgataccctttcttccagttgatcgcatcggctcctgaggcttctgcattcttcacgtagttctc 
gagccttggttttcagctccatcagctcctttaagcacttctctgtattggttattctagttata 
cattcttctaaatttttttcaaagttttcaacttctttgcctttggtttgaatgtcctcccgtag 
ctcagagtaatttgatcgtctgaagccttcttctctcagctcgtcaaaatcattctccatccagc 
tttgttctgttgctggtgaggaactgcgttcctttggaggaggagaggcgctctgcgttttagag 
tttccagtttttctgttctgttttttccccatctttgtggttttatctacttttggtctttgatg 
atggtgatgtacagatgggttttcagtgtagatgtcctttctggttgttagttttccttctaaca 
gacaggaccctcagctgcaggtctgttggaataccctgccgtgtgaggtgtcagtgtgcctctgc 
tggggggtgcctcccagttaggctgctcgggggtcaggggtcagggacccacttgaggaggcagt 
ctgcccgttctcagatctccagctgcgtgctgggagaaccactgctctcttcaaagctgtcagac 
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aggqacacttaagtctgcaqaggttactqctgtctttttgtttgtctgtgccctgcccccagagg 
t 99 a 9 cct:aca 9 a 9g ca 99 ca 99cctccttgagctgtggtgggct:ccacccagttcgagcttccc 
99 ct 9 ct:t:t 9 tt:t:acct:aa 9 caa 9 cct: g99ctatggcgggcgcccctcccccagcctcgttgccg 
ccttgcagtttgatctcagactgctgtgctagcaatcagcgagattccgtgggcgtaggaccctc 
tgagccaggtgtgggatatagtctcgtggtgcgccgtttcttaagccggtctgaaaagcgcaata 
ttcgggtgggagtgacccgatt t tccaggtgcgcccgtcacccctttctttgactcggaaaggga 
actccctgatcccttgcgcttcccaggtgaggcaatgcctcgccctgcttcggctcgcgcacggt 
gcgcgcacacactggcctgcgcccactgtctggcgctccctagtgagatgaacccggtacctcag 
atggaaatgcagaaatcacccgtcttctgcgtcgctcacgctgggagctgtagaccggagctgtt 
cctattcggccatcttggctcctccctccaattcttttgggtatatatccagcagtgggattgct 
ggatcacatggtaatttttaattttttgaagaatcatcatactgttttccacggcagcagcacca 
ttttatgttcccaccaacagttcattctagtttctccacatccttgccaacacttgctattttct 
ctttttgacagtacccatcctaatgagtgtgaggtcctgtctcattgtggttttgattcttgagg 
ctttttaaagctttttgtttcat tataatt tttattggat t acaaaaggaacacaggt aatt tta 
tttggaaactatgaaaaataataaaaattatcttctcagaaaatgattcttgttaacatttaagc 
tcagttaagctctctcactttctctcccttctctctctttgtacaacttttaaaaaatatagtag 
gggtgagactatatgtatctatactatagtaggggtgagactatatgtatccttcctttttcact 
taatctcatgccttgagtagctttccactttattaaaaatgtgatgccattcaattgtatagtaa 
atacatatatgtaagcaaaacactgaaaactcttattctgggttccagcaagccatacctggaat 
ggtgtaagcaggtagtttgcttggtgtgaacgtgttgttgaggcagctgccattgtgttgtgagt 
gggccacacgaacttgttctgttgtgtgtagacagtgtgtgctgatcctattaggaacagccaac 
gctttgtgtgagccacacacggttctaagtgctttgcttctgttaactcagtgaatcctcacaac 
tccatgacggaatgctctaattatccccattttatagatggggcaactgaggtccaagagactac 
ataatttcccgaagttcacacaggtagcagatggcagagccgggtcaggagtccaccatcttacc 
acgcagactgttttagccagagactctccggatctgctgtaggggacagaatacagctttatcgc 
cgcacctgtccaccaagatggccgtagccacagagcttggttgggtaacgtcctctttatgtgac 
aggaacgttgctgatggggtttctgaaggtacttcctgctctttgtctcctggaagactgtgtct 
tcaggaatgtctctgaccctgcccagagttgaacggatgctgggaacccagcacctgcacacggc 
cttccctccaggactctgcgcacctctgtgctccacaggagacatgcaggtgctttctctcatga 
gctcaggctcctgggctgacagctctccgaagctcgtggtgaggctcggtctctaactgtgccac 
ttgccgatggcctctgttcacaaggcttcccctgctcttcgatcttgcatcaccccttgaatttg 
aaatccagagcagcccactcagagaccagtgtgaggaattagtgtccaggccacagatccaggga 
ctgggcacaaacatctgcctgttgagtaggaactgagctgtggccattggcaaaaaaggaggggt 
gagcatggctgtttcttggggagctaacattcactatcttgtctcctccctcaggatgtggggca 
atcaagttggggatgaaggagcaaaagccttcgcagaggctctgcggaaccaccccagcttgacc 
accctgaggtaactgtggccctgctgtctccaggggccaacctggtccctcccagctgctctagg 
tttgctggggaagggtgattcgtgctcctaatagaagaggaatttgcatgtgtgattttccttac 
tcttgtcaaacctttctttgatgcataagaggccatctagtaaagcacattcttctcttttttta 
actttaagttctgggatacatgtagaagatgtgcaggtttgttacataggcaaatgcatgccatg 
gtgatttgctgcacctatcaacctgtcatctaggttttaagccctgcatgcattaggtatttgtc 
ctaatgcttgccctccccttgccccccacccccaacaggccctggtgtgtgttgttcccctccat 
gtgtccatgtgttctcattgttcaactcccacttacgagtgagaacatgcagtgtttggtttttt 
gttcctgtgttagtttgctgagaatgatggtttccagcttcatccatgtgccagcaaaggacatg 
atctcatttttttttatggttgcatagtattccatagtgtgtatgtgccacattttctttatcca 
gtctatcactgatgggcatttgggttggttccaagtctttgctattgtaaatagtgctacaataa 
acatacatgtgcttgtgtctttatagcagaatgatttataatcctttgggtaaatacccagtaat 
gggattgctgggtcaaatggtatttctggttctagatccctgaggaatcaccttaagtgtttatt 
cagctcagtgaattctgcatgtgtcccacaccagccaaccaccacccccatcaagacagaggaca 
tttccagcccctcagccatccctgcatgtcccttgctggtagagggagggtttcctaagtgcaga 
tgaaacttaataagatgctggccagcagattcctgccccttccttgtcctcaggatgatgctgga 
aaagagggactcttcctctctataaatggggatgcacctacccagcccccgcttaggctgctggc 
caaatcttgggaccttggtatgtccacggctctgctgctgttcttcctaccactgaaaaagagtc 
caagaaggtggggacagtagcagaagagactttgccaggtcttgcagatggggtaccttgatggg 
gccagcctttagaaggacagcttgccaggcctcgccagcctcctgcccatgtgcagaaacctgag 
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gtgccgaccccagcccactgttgtgtgagcaggctgtgctgatgacccatttcccgtccagcctg 
cccttgtgctctgtgtgtgggctctggggcagcagcgcctgggcactactgctgcagctgaacac 
ttctgcatcctgccccgagtgagcctgggctggggccacagccaggcagaggcttcccagctgtt 
ctgatgttgaagctaagattgaatgtagatgtgtctttaataattcaccccaagtgtgttccttc 
ctagtcttgcgtccaacggcatctccacagaaggaggaaagagccttgcgagggccctgcagcag 
aacacgtctctagaaatactgtggtaatagctcgagtcatttcatttgtttgtttgtttttctgt 
gatagggtcttgctttgtcgtccaggcttgagtgcattggtgtgatctcagctcactgcagcctc 
cacctcccaggctcattcgaacctcccgccttggccttccgagtcctgagactataggcatgcac 
caccacacccagttaattttaaaattttttgtagagatggggttttgctatgttacccaggctgg 
tcttgaactcctgggctcaagcagttctcctgccctggcttctcaaagccctgggattgcaggtg 
tgagccactgcacctggcacagagtcattttggagggtttaggtcccaggaattatcccaggggc 
tgcacatggcctggaatcttaacagaaaaggtgtctcccaattggaaaggctctaggcctttcag 
ttaagttgataatttcctcctagagaagagaatagccacttctacaagcataaacaggtacagga 
ggaggaagtgggctccgggagcctggatctgaggccttggccttctaggccccaggagaactaga 
acgctggccatgcaagctatccaggtatccttggataccttcagatgtgcttagcagaggccaac 
ttccacacacttggctcaaaattttctcccttcctcctcttcatctgccttcccccaggcagcct 
cctccttccccaggtcttcacatcagggtttggcctttatgctccatccagctcatctgtcactt 
gtcacctgaagcccacagtcctcgctccctctctgcactctagggcacttactaagtggatgtgg 
cctcctgagagtgttttttgttggtgttcccttttttatggccacttaatgttttattttgcttt 
atttgtatttacatctctgtatcataaattccatacaggtggctgggagcagtgactcacatctg 
taatcccagtactttggaaggctgaggtgggaggatcgcttgaggccaagagttcgagactagcc 
tgggcaatatagcgagaccctctatctacaaaaaaaaaaaacattccttacaggttaagtgaggg 
agttgtattacaaccctccctatcatctactcagagcccagtgctcatttgatcttgctaaatta 
gttactgagaataatgacaatatcctcttcatgagagagttttgacattaggcctgctgtccagt 
aagtgcattttaaattctttcccctcaacaaatcatttaacattttgaaaagtagtttatgtttt 
ttggaaaaaatgtaagacactaaaggaggacatgaaagtacctcctaaagttcctgctaaaagga 
ggaagtgaaagtacctccctttgtgttttccaaaataacctttcctttctagccttttgttctat 
gtatgttcaaagatatgcaaaacagaatagcattcaagcagtggctctaaaaatattgtaatcac 
atactttacatgtctcctttagggtttctccatcttgatgctgttgacattttggtccaagtgat 
tctttattatggtagggctgtcctgtgcatcatagacggtttagccgcatctctgccctgtacct 
cccagtggtgaggatcaaaaatgcctccggacatggccaggtgccccatggagagtgaaatcaca 
tggatagtagtaatgtcaacacctagaagccctcaagtgctgactgcatgccatgtgttattcta 
cactttttccctgtgttaactcactcagcctcacaaccactctatacgatctctactgttaacgt 
tcaccagtgagaaaactcagacccaaagaacttaagcctgttgcccgaggtcaccctgctggtgg 
gtgatacaaacctgcccaggctgagtccggagtagatgtcaatgctgtgttcttctccctcctca 
ttctacctcattctccctacaagctgcacaacatctcgaatagatatcacaatatatttcatcag 
ttgtttctgatctaaatttgttcagattttacattaggataataccacaatgcatgctgcaatgt 
ataaagctttgtgtgtatatccttgcacactgtagggtaaatttctagaagtctgattgtcttaa 
aatgaagcacattaaaaatttgggcaggcacatccaaactgcccttcaaggaattttttttttta 
aatgttctttctgttctattcttcttcctaatgattctttcgtccactggcacaagtgggtccta 
ccctgtttacaccaaggagctttggtgctttatccagaccacttctggttctaaggaccattgag 
agacttcctgaactttcagtcacttaacttgggtccctcacaagttaactgagagcaaagtactg 
aacacattttaatgtgcagtcagtgactgtttcaggtcttcaaactaacttggataacacactgt 
cagtggtgttcaagggaccctgggactagaggagaactgagaagcaggcattggccctttgtttt 
ccgtgggcccccatcttccatgaaatctgagggctcagcaaaggtggggagggagggtgggctcc 
tctacaggtagctgggctaagaaataggagcccaggtacaggatttgcattzaaaaatgagtccca 
ttgaccttctgtggggctgacaggctgggcttggagcctggctgttttctgggttctcagcaagt 
gatcatctgcatagctggagagccttgggctgagctcccgctcctgtgaactctaaaacaatgtc 
tgccaagtaggctctcttgagtaaatacttccttttttttccttaggctgacccaaaatgaactc 
aacgatgaagtggcagagagtttggcagaaatgttgaaagtcaaccagacgttaaagcatttatg 
gtaactcagagagccttacaatttcagactgtgctacttttcaaaagtattttttgagataaaat 
ttacatactgtaaaattcactctcttaaagtatacaattcagaggtttttagtgcaaccatcacc 
acctaattctagaacattttcactcctcctccccactccaaaaagccctggtatccattaagcag 
tcactccctgtcctcctccccagaccctggcaaccactaatccgctttctgtctctatggatttg 
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cctactctgggcatttcatataaatggaatcaagcaatatgtgaccttttgtctctgtgttctag 

catgtttcattcctttttatggctaaatgataattcactctaaggaaattttgcagtttattaat 

cagttgatgggacatttgggttgtttctactttttgactattatgcgtaatgctactgtgaacac 

tcctgttcatgcttttgggtgaacatatgttttcatctcttttgggaatatacctgggaatagaa 

tttctgggtcatatggcaattctgtaactttttgaggagccaccaaactgttttctaaagtggat 

gtactattttacattctcgccagcaatgtatgtggattccaatttctccacatcctcaccaacac 

ttattattgtccatcttttaaaatctagttatactagtggatgtgaagtaatattgtggttttga 

tctgcatttccctgatgacaacaatgttgaatgtctttttatgtgcctactgggagtctgtatag 

cttctttggagaaatgtctccatatcctttgcccattttaaaattgggtttgtcttctaatgctg 

agttataggggttctctatatattctgggtgctagacctttactagatacaggttttgcaagtat 

tttctttctttctgtggagtttttcctctttcttgatagtgacctttaaaggacaacagttttta 

atttttgttttttttgagatggagtcttgctcttgtcacccagacaggagtgcagtggcatgatc 

tcagctctctgcaacctccacctcctgggttcaagcgattcttctgcctcagcctcctgagtagt 

tgggattacaggcatcagccaccatgcctgtctcattttgtatttttaatagagatggggtttca 

ccatttaggcccaggctggtcttgaactcctgacctcaggtgatccacctgcctcagcctcccaa 

agtgctgggattacaggcgaaaagccactgcacctggccaatagtttttaattttgatgaagtcc 

aatttatctatttttttctttggttgcttgtgctttcagtgtcttatctaagaaatgattgccta 

atccaagatcacaaagaactccacctaagttttctgttaagcgttatagttgtttcccctcacat 

ataggtctgcaatccattttgagttaatttttgtatagtgtaaagtgagggttaacctcattctc 

ttgcacgtggatatccagctgtcccggcagcaccacgtgttgaacagattatcttttcctattga 

atggccttgacacccttgtcaaaaatcaattgaccataaatgtatgggtttatttctgaattctc 

tgttctggtccattgatttatatgtctctcctatgccaggaccattgctgtagctttgtgtagta 

cattttgaaatcaggaggtgtgagttctactttgttcttctttctcaagattgtttagaccattc 

tgggttctttgcatttcttatgaattcagactcaccttgtcaatttctgcaaaaagactagactc 

tgctacatattgttttttctttcctttttagcctgcagaattatttgatcccattccctaagtgc 

aggccagcctctccagggagagcagagctaggacagggtcagaaagagagtcttggctgctttgt 

gcatt cccaacc tgcactggccctagtgaaggcagcccgagtgggtggatgtgcctggacactgc 

aggctttttaggggcattaggtgctctccttcctggcctcctgccacatcttggttggaggctgc 

cttccctgccttcaaaaaagcctaagtggtgactagaaaacagcagagtgtaactgaatacagaa 

cttggtgcccacttcctggttctatttttgtcccttttgaaagggaaggtcattacctctgccat 

tgaacccaggggccctagcccttgtggggtatggctgggagcaccagatcctggctgcagcccag 

ccaccagtggtcctgtgtgcttgggcagtaacagtgacaagagctcccttccccctggacactgt 

gcctaataccctcctcttgaaatctcacacacccagtggatggggggcactcttatagttattct 

cagtttacagatgacacaactgaggcacagacagatgcgtttatttcttcaaggttctgtagctg 

aacagtggggagggagggtttaagaggagctgcacccgctctgcaatactgcctctcacgaggga 

gtcctcttcattcatgacagcatagggccctcgtcttcctggtaagggctt-ccttcttgggtcag 

tgccaggatttctaagggtcatgtttagcaggagcctattctacaaacagccaggagcagggaat 

gactctgtgatgaagcggagacactacagcctcttgatgcatttatttcctggttgggttagaag 

cgtagctgcccaagggagcatttcaggagaggcctggcttcctagcgatagctgaaaactttgtt 

tcatttgaatcactgctacccagaacaatggggtgcattctcagagtccccattattaaagcttt 

tccactgagccccatgagaactattcatgagaactatttcatggcagcataactgtttctcctcc 

ctccctcttgcatgttggtagcctcttaactttaaaacctgccttgcctttccctagctacctgg 

aaggagacgtcagacttcctgtcccatggtgtgtttcttacaatttgttgttcagattggtggtc 

tcccaaatatatataaaaatataaatggagtctcactctgtcacccaggctggagtgcagtggca 

cgatcttggctcactgcaacctccacctcccagttcaagcaattctcctacctcagtctcccgag 

tagctgggagtacaggtgcacaccaccatgcccagctaatttttttgtatttttaatagagacag 

gttggccaggatggtatcgatctcctgagctcgtgatccacccacctcggcctcc^ 

ggattacaggtgtgagccactgcacccggccccaaatattttgattatgcacctctgcagtgaaa 

aatgcaaacacacacatcagttcatgtattacattatgttcactataaaaacaaacagaaaattt 

aaaaaatatcaagctatcctttactctagtggatcttacctggacacttttagccagatacaaag 

tcacatggactcagttcttcccctgaccaacttgtctcttatcccaaaacacccttgcaactccc 

ttacgaaggggtcaaatttgatccagtattatggattttatacaagttatgttcttctttcaggc 

ttatccagaatcagatcacagctaaggggactgcccagctggcagatgcgttacagagcaacact 

ggcataacagagatttggtaagatcccagcgtttgtcacagtaataacaccagtgactgtttact 
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caccaccactgactgtgcaaggcacaacgcagggtggtttctgtttattcctccagcaaccctgc 

acagtaatggtattacctctgttttacagaggtagacagaggcccagaccagtgaaataaggttg 

cccaaggtcactacgagagaagctagaattcagcccagaatgcctgattccatattctgtgctct 

cctgccctgggcccccgccctcatctaccttcattgggtgggatgggggaagtggccagtgaaat 

gatttcctagtggaagtaaatccccctgggactcagcaattgagagatgactgtgttggccagga 

gtttggagctcattcttccccttttctgggttccgtaagacatttccaggctgacttgaactgac 

ctgtgctctttgtctacttcttttttctgctttgagaacttccttatgctaatagaagaaaaaaa 

gtttgctttactgtgacattgagcgccatgccacttctttcttgcctcccataaggcacagacac 

tccccactcagcagctcccttaacaacttaattgcctgggtgacgtgggactgggtggatgctgg 

gagaggggccttattaactatgtcctcctttcatgactggggagaatttcatagccaattaaaaa 

aaaacaaaaaacagctccttggccaacacaggctcctcatacagtgttttttaaactttgcttta 

gaacttgtttggaacttgtcataaaatcgatcagtttggtgaattgcaaccaacaatatttaaaa 

agaaaacagaacagaacaaaatatcaggatgcaatgtgcatggtatgaaagtatcatttcattca 

tcttagttcatgcttgcatgtgagtgggtgtgtgtttgcataagtgttggttcacaacataaaat 

gtaattcttatttagggttgtagacaaaaggtttttttttaaaaaaaacactgttggctaggcat 

ggtggctcatgcctgtaataccagcactttgggaggccaagatgggcagatctcttgagcacagg 

agtttgagaccagcctgggcaacatgcgaaaccccgtcactacaaaaattagcccgacatggtgc 

tatgtgcctgtagtcccagctactcaggaagctgatgtgggaggatggatgcatgggagatcaag 

gctgcagtgagccaggatcatgccactgcaatccagcctgggtgccagagaccctgtctcaaaaa 

acaaaaaagaaaaaaagaaaaacaccatcatagagaatagagcccagatctaaacagacacctgt 

ggcctgtgtgcctgcgaagcccagcctgcccagcagcctgggaagcactggagggcactggaact 

gtttgcatgggtgtttgccctcaggccactccgtttctgctgattcttaagttttgaggacagca 

ggcagagggggagaggaaggagactgccagactacagaacagtttgcagagcacagttggcttcc 

acttttctctgtagctggtcaggcgggtagtaaagacctacagttgctttaattctgtcaagttt 

caaaatctgcattgcttccctcttgagggtcaccattcctacacaaggaaccattttagtagggc 

caggagacttcagcttcaaggcctgcacttgtgtcagggtggagaggggaactggccaccaattc 

agagagggcaggacaggcggcatgggtgctggtcttgggagtgtcttcacttaggtccctggctt 

gttctgggagcctccagagcatgctcctctgtgtgtgacttcatgggactgggctctgagaaggc 

tgtggctttgttggccctgccagggactgccacaccaggccacagggttgtggttgagctggccg 

gggagccacgttcagggagcagctctgcttggagccaacacttacagagtaagccttctccttgg 

acttgttaactgtactgacacttatttctacctcattcctttctgaaaataacttggaagtctga 

agtcccttgatgagttctgtctttaagaacagaaattagaggtgaacaatgaacactgtaaatta 

cagaaatgtatcccactccagtataacagctttctgtgaggctatctcctccagactgtggctct 

gggagggtggggcctgagtcaaggtcctagggactagtgctgtgtcttcatttattccttgaata 

acgaaacgcttgagcatcagggactgtgctagcaccaaaaatccagtggtgaacaacatggcttc 

atgggttcactgtctagaaagggagaagcacattaaagaaaaaatcatttgcgtaattatttaat 

tacaactgtgatgggtactatcacaaaggggaaggccaagagggaacctgatttagatgaggttg 

cagggaaggcctctctgaggaagcagcacttacactaagccatgaaggatgaataggagctagtc 

agctgaggtgagtattctgcgtagggaacagcatgtgcaaagggtctggggcaggagggagtgtg 

gtgtcctggaagaactgccagaagctgctgtgccccagggttcagacagtgtggaagaggggact 

acaggaggctgaggagataggcagggactggaccataaaagatctgtgggtcatgatgtgcattt 

tggtctttatcctaaaagtgatggaaagtcagtgaacagtttgaagcaggagaggcatgtgatca 

gatctgcaatgcaaaaagaccaattcttggctcttctaggaaactgaattggagaaggccagagt 

acgtggaaatgacctgtcagtaggacattgtactgatgcagggaagagatgatgggtgctcagac 

caagatggccggccaaagacatagaggttccagggaggcattctagattcttaggaattagggga 

gaactttgtgatacaaggaacatggggatgagaaggaaggtgtccaggttgacccccaggttact 

aacctgctcagcaggatgagagtggtccattcactaagccaggggaccctaggaggtgtggctac 

tttgaggtgtgggggagaggtccaagtgaggatgccaagcaggtaactgcctccacggacataca 

aacaaggccgtggcattgatgagatcgggtggggaaaagggcttagccccaaacctggaggaaat 

ctcagatgtagaggtcacatggaggagaatataggaaaggaaattgaagtagagtgctcagatgc 

aggagaaaaatcagcgcatataaccaagccaaggggagggagtgcctcaagaaggagggagagga 

gaggtcaggacagccaaaatcctgagggccaagaaagacaagacctggaaaatgtcattaaattc 

aggcttatggaggctacaggtgaccttagtgagacccagtgaacagagggatggcagctggagag 

gatccatgctaatatgaaggaactatctgcaaagggtatgttccttaatttcagggatacatgtg 
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tattgtgtgatacacgagtgtgtgctatgaacacaccttgggaaggagtgtgcgaggatccttaa 
cattttacctgtgtacttttgccttcctccttttcaacagcctaaatggaaacctgataaaacca 
gaggaggccaaagtctatgaagatgagaagcggattatctgtttctgagaggatgctttcctgtt 
catggggtttttgccctggagcctcagcagcaaatgccactctgggcagtcttttgtgtcagtgt 
cttaaaggggcctgcgcaggcgggactatcaggagtccactgcctccatgatgcaagccagcttc 
ctgtgcagaaggtctggtcggcaaactccctaagtacccgctacaattctgcagaaaaagaatgt 
gtcttgcgagctgttgtagttacagtaaatacactgtgaagagactttattgcctattataa 
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